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METHODS OF DIAGNOSIS OF BREAST CANCER, COMPOSITIONS 
AND METHODS OF SCREENING FOR MODULATORS OF BREAST 

CANCER 

5 

CROSS-REFERENCES TO RELATED APPLICATIONS 

This application claims priority to USSN 60/263,965, filed January 24, 2001; 
TJSSN 60/265,928, filed February 2, 2001; USSN 09/829,472 filed April 9, 2001; USSN 
60/282,698, filed April 9, 2001; USSN 60/288,590, filed May 4, 2001; and USSN 
10 60/294,443, filed May 29, 2001, all of which are incorporated herein by reference in their 
entirety. 

FIELD OF THE INVENTION 

The invention relates to the identification of nucleic acid and protein 
1 5 expression profiles and nucleic acids, products, and antibodies thereto that are involved in 
breast cancer; and to the use of such expression profiles and compositions in the diagnosis, 
prognosis and therapy of breast cancer. The invention further relates to methods for 
identifying and using agents and/or targets that inhibit breast cancer. 

20 BACKGROUND OF THE INVENTION 

Breast cancer is one of the most frequently diagnosed cancers and the second 
leading cause of female cancer death in North America and northern Europe, with lung 
cancer being the leading cause. Lifetime incidence of the disease in the United States is one- 
in-eight, with a l-in-29 lifetime risk of dying from breast cancer. Early detection of breast 

25 cancer, using mammography, clinical breast examination, and self breast examination, has 
dramatically improved die treatment of the disease, although sensitivity is still major concern, 
as mammographic sensitivity has been estimated at only 60%-90%. Treatment of breast 
cancer consists largely of surgical lumpectomy or mastectomy, radiation therapy, anti- 
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hormone therapy, and/or chemotherapy. Although many breast cancer patients are effectively 
treated, the current therapies can all induce serious side effects which diminish quality of life. 
Deciding on a particular course of treatment is typically based on a variety of prognostic 
parameters and markers (Fitzgibbons et al., 2000, Arch. Pathol. Lab. Med. 124:966-978; 
5 Hamilton and Piccart, 2000, Ann. Oncol. 1 1 :647-663), including genetic predispostion 
markers BRCA-1 andBRCA-2 (Robson, 2000, J. Clin. Oncol. 18:113sup-U8sup). 

Imaging of breast cancer for diagnosis has been problematic and limited. In 
addition, dissemination of tumor cells (metastases) to locoregional lymph nodes is an 
important prognostic factor; five year survival rates drop from 80 percent in patients with no 

10 lymph node metastases to 45 to 50 percent in those patients who do have lymph node 

metastases. A recent report showed that micrometastases can be detected from lymph nodes 
using reverse transcriptase-PCR methods based on the presence of mRNA for 
carcinoembryonic antigen, which has previously been shown to be present in the vast 
majority of breast cancers but not in normal tissues. Liefers et al., New England J. of Med. 

15 339(4):223 (1998). 

The identification of novel therapeutic targets and diagnostic markers is 
essential for improving the current treatment of breast cancer patients. Recent advances in 
molecular medicine have increased the interest in tumor-specific cell surface antigens that 
could serve as targets for various immunotherapeutic or small molecule strategies. Antigens 

20 suitable for immunotherapeutic strategies should be highly expressed in cancer tissues and 
ideally not expressed in normal adult tissues. Expression in tissues that are dispensable for 
life, however, may be tolerated Examples of such antigens include Hei2/neu and the B-cell 
antigen CD20. Humanized monclonal antibodies directed to Her2/neu 
(Herceptin®/trastuzumab) are currently in use for the treatment of metastatic breast cancer 

25 (Ross and Fletcher, 1998, Stem Cells 16:413-428). Similarly, anti-CD20 monoclonal 
antibodies (Rituxm®/rituximab) are used to effectively treat non-Hodgekin's lymphoma 
(Maloney et al., 1997, Blood 90:2188-2195; Leget and Czuczman, 1998, Cuir. Opin. Oncol. 
10:548-551). 

Other potential immunotherapeutic targets have been identified for breast 
30 cancer. One such target is polymorphic epithelial mucin (MUC1). MUC1 is a transmembrane 
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protein, present at the apical surface of glandular epithelial cells. It is often overexpressed in 
breast cancer, and typically exhibits an altered glycosylation pattern, resulting in an 
antigenically distinct molecule, and is in early clinical trials as a vaccine target (Gilewski et 
al., 2000, Clin. Cancer Res. 6:1693-1701; Scholl et aL, 2000, J. Immunother. 23:570-580). 

5 The tumor-expressed protein is often cleaved into the circulation, where it is detectable as the 
tumor marker, CA 15-3 (Bon et al., 1997, Clin. Chem. 43:585-593). However, many patients 
have tumors that express neither HER2 nor MUC-1; therefore, it is clear that other targets 
need to be identified to manage localized and metastatic disease. Many other genes have been 
reported to be overexpressed in breast cancer, such as EGFR (Sainsbury et al., 1987, Lancet 

10 1(8547):1398-1402), c-etbB3 (Naidu et al., 1988, Br. L Cancer 78:1385-1390), FGFR2 
(Penault-Llorca et al., 1991, Int J. Cancer 61:170-176), PKW (Preiherr et al., 2000, 
Anticancer Res. 20:2255-2264), MTA1 (Nawa et al., 2000, J. Cell Biochem. 79:202-212), 
breast cancer associated gene 1 (Kurt et al., 2000, Breast Cancer Res. Treat 59:41-48). 
Although monoclonal antibodies to the protein products of some of these overexpressed 

1 5 genes have been reported (for review, see Green et al., 2000, Cancer Treat Rev. 26:269-286), 
none are currently approved for breast cancer therapy in the US. 

Disclosures of certain genes and ESTs described as bang expressed in breast 
cancer are found in international patent applications WO-99/33869, WO-97/25426, WO 
97/02280 and WO-00/55173, WO-98/45328 and WO-00/22130. Similarly, genes and ESTs 

20 described as being expressed in breast cancer are disclosed in US Patent Nos. 5,759,776 and 
5,693,522. The utility of such genes is described in each of these publications, and their 
disclosures are incorporated herein in their entirety. 

While industry and academia have identified novel sequences, there has not 
been an equal effort exerted to identify the function of these novel sequences. The 

25 elucidation of a role for novel proteins and compounds in disease states for identification of 
therapeutic targets and diagnostic markers is essential for improving the current treatment of 
breast cancer patients. Accordingly, provided herein are molecular targets for therapeutic 
intervention in breast and other cancers. Additionally, provided herein are methods that can 
be used in diagnosis and prognosis of breast cancer. Further provided are methods that can 

30 be used to screen candidate bioactive agents for the ability to modulate breast cancer. 
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SUMMARY OF THE INVENTION 
The present invention therefore provides nucleotide sequences of genes that 
are up- and down-regulated in breast cancer cells. Such genes are useful for diagnostic 
5 purposes, and also as targets for screening for therapeutic compounds that modulate breast 
cancer, such as hormones or antibodies. Other aspects of the invention will become apparent 
to the skilled artisan by the following description of the invention. 

In one aspect, the present invention provides a method of detecting a breast 
cancer-associated transcript in a cell from a patient, the method comprising contacting a 
10 biological sample from the patient with a polynucleotide that selectively hybridizes to a 
sequence at least 80% identical to a sequence as shown in Tables 1-25. 

In one embodiment, the present invention provides a method of determining 
the level of a breast cancer associated transcript in a cell from a patient. 

In one embodiment, the present invention provides a method of detecting a 
1 5 breast cancer-associated transcript in a cell from a patient, the method comprising contacting 
a biological sample from the patient with a polynucleotide that selectively hybridizes to a 
sequence at least 80% identical to a sequence as shown in Tables 1-2S. 

In one embodiment, the polynucleotide selectively hybridizes to a sequence at 
least 95% identical to a sequence as shown in Tables 1-25. 
20 In one embodiment, the biological sample is a tissue sample, hi another 

embodiment, the biological sample comprises isolated nucleic acids, e.g., mRNA. 

In one embodiment, the polynucleotide is labeled, e.g., with a fluorescent 

label. 

In one embodiment, the polynucleotide is immobilized on a solid surface. 
25 In one embodiment, the patient is undergoing a therapeutic regimen to treat 

breast cancer. In another embodiment, the patient is suspected of having metastatic breast 
cancer. 

In one embodiment, the patient is a human. 

In one embodiment, the breast cancer associated transcript is mRNA. 
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In one embodiment, the method further comprises the step of amplifying 
nucleic acids before the step of contacting the biological sample with the polynucleotide. 

In another aspect, the present invention provides a method of monitoring the 
efficacy of a therapeutic treatment of breast cancer, the method comprising the steps of: (i) 
5 providing a biological sample from a patient undergoing the therapeutic treatment; and (ii) 
determining the level of a breast cancer-associated transcript in the biological sample by 
contacting the biological sample with a polynucleotide that selectively hybridizes to a 
sequence at least 80% identical to a sequence as shown in Tables 1-25, thereby monitoring 
the efficacy of the therapy. In a further embodiment, the patient has metastatic breast cancer. 
10 In a further embodiment, the patient has a drug resistant form of breast cancer. 

In one embodiment, the method further comprises the step of: (iii) comparing 
the level of the breast cancer-associated transcript to a level of the breast cancer-associated 
transcript in a biological sample from the patient prior to, or earlier in, the therapeutic 
treatment. 

15 Additionally, provided herein is a method of evaluating the effect of a 

candidate breast cancer drug comprising administering the drug to a patient and removing a 
cell sample from the patient The expression profile of the cell is then determined. This 
method may further comprise comparing the expression profile to an expression profile of a 
healthy individual. In a preferred embodiment, said expression profile includes a gene of 

20 Tables 1-25. 

In one aspect, the present invention provides an isolated nucleic acid molecule 
consisting of a polynucleotide sequence as shown in Tables 1-25. 

In one embodiment, an expression vector or cell comprises the isolated nucleic 

acid. 

25 In one aspect, the present invention provides an isolated polypeptide which is 

encoded by a nucleic acid molecule having polynucleotide sequence as shown in Tables 1-25. 

In another aspect, the present invention provides an antibody that specifically 
hinds to an isolated polypeptide which is encoded by a nucleic acid molecule having 
polynucleotide sequence as shown in Tables 1-25. 
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In one embodiment, the antibody is conjugated to an effector component, e.g., 
a fluorescent label, a radioisotope or a cytotoxic chemical. 

In one embodiment, the antibody is an antibody fragment. In another 
embodiment, the antibody is humanized. 
5 In one aspect, the present invention provides a method of detecting a breast 

cancer cell in a biological sample from a patient, the method comprising contacting the 
biological sample with an antibody as described herein. 

In another aspect, the present invention provides a method of detecting 
antibodies specific to breast cancer in a patient, the method comprising contacting a 
1 0 biological sample from the patient with a polypeptide encoded by a nucleic acid comprising a 
sequence from Tables 1-25. 

In another aspect, the present invention provides a method for identifying a 
compound that modulates a breast cancer-associated polypeptide, the method comprising the 
steps of: (i) contacting the compound with a breast cancer-associated polypeptide, the 
1 5 polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence at least 
80% identical to a sequence as shown in Tables 1-25; and (ii) determining the functional 
effect of the compound upon the polypeptide. 

In one embodiment, the functional effect is a physical effect, an enzymatic 
effect, or a chemical effect 
20 In one embodiment, the polypeptide is expressed in a eukaryotic host cell or 

cell membrane. In another embodiment, the polypeptide is recombinant 

In one embodiment, the functional effect is determined by measuring ligand 
binding to the polypeptide. 

In another aspect, the present invention provides a method of inhibiting 
25 proliferation of a breast cancer-associated cell to treat breast cancer in a patient, the method 
comprising the step of administering to the subject a therapeutically effective amount of a 
compound identified as described herein. 

In one embodiment, the compound is an antibody. 
In another aspect, the present invention provides a drug screening assay 
30 comprising the steps of: (i) administering a test compound to a mammal having breast cancer 

6 
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or to a cell sample isolated therefrom; (ii) comparing the level of gene expression of a 
polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 
as shown in Tables 1-25 in a treated cell or mammal with the level of gene expression of the 
polynucleotide in a control cell sample or mammal, wherein a test compound that modulates 
5 the level of expression of the polynucleotide is a candidate for the treatment of breast cancer. 

In one embodiment, the control is a mammal with breast cancer or a cell 
sample therefrom that has not been treated with the test compound. In another embodiment, 
the control is a normal cell or mammal. 

In one embodiment, the test compound is administered in varying amounts or 
10 concentrations. In another embodiment, the test compound is administered for varying time 
periods. In another embodiment, the comparison can occur after addition or removal of the 
drug candidate. 

In one embodiment, the levels of a plurality of polynucleotides that selectively 
hybridize to a sequence at least 80% identical to a sequence as shown in Tables 1-25 are 
15 individually compared to their respective levels in a control cell sample or mammal. In a 
preferred embodiment the plurality of polynucleotides is from three to ten. 

In another aspect, the present invention provides a method for treating a 
mammal having breast cancer comprising administering a compound identified by the assay 
described herein. 

20 In another aspect, the present invention provides a pharmaceutical 

composition for treating a mammal having breast cancer, the composition comprising a 
compound identified by the assay described herein and a physiologically acceptable 
excipient. 

In one aspect, the present invention provides a method of screening drug 
25 candidates by providing a cell expressing a gene that is up- and down-regulated as in a breast 
cancer. In one embodiment, a gene is selected from Tables 1-25. The method further 
includes adding a drug candidate to the cell and determining the effect of the drug candidate 
on the expression of the expression profile gene. 

In one embodiment, the method of screening drug candidates includes 
30 comparing the level of expression in the absence of the drug candidate to the level of 
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expression in the presence of the drug candidate, wherein the concentration of the drug 
candidate can vary when present, and wherein the comparison can occur after addition or 
removal of the drug candidate. In a preferred embodiment, the cell expresses at least two 
expression profile genes. The profile genes may show an increase or decrease. 
5 Also provided is a method of evaluating the effect of a candidate breast cancer 

drug comprising administering the drug to a transgenic animal expressing or over-expressing 
the breast cancer modulatory protein, or an animal lacking the breast cancer modulatory 
protein, for example as a result of a gene knockout. 

Moreover, provided herein is a biochip comprising one or more nucleic acid 

1 0 segments of Tables 1-25, wherein the biochip comprises fewer than 1000 nucleic acid probes. 
Preferably, at least two nucleic acid segments are included. More preferably, at least three 
nucleic acid segments are included 

Furthermore, a method of diagnosing a disorder associated with breast cancer 
is provided. The method comprises determining the expression of a gene of Tables 1-25, 

1 5 preferably a gene of Table 25, in a first tissue type of a first individual, and comparing the 
distribution to the expression of the gene from a second normal tissue type from the first 
individual or a second unaffected individual. A difference in the expression indicates that the 
first individual has a disorder associated with breast cancer. 

In a further embodiment, the biochip also includes a polynucleotide sequence 

20 of a gene that is not up- and down-regulated in breast cancer. 

In one embodiment a method for screening for a bioactive agent capable of 
interfering with the binding of a breast cancer modulating protein (breast cancer modulatory 
protein) or a fragment thereof and an antibody which binds to said breast cancer modulatory 
protein or fragment thereof In a preferred embodiment, the method comprises combining a 

25 breast cancer modulatory protein or fragment thereof, a candidate bioactive agent and an 
antibody which binds to said breast cancer modulatory protein or fragment thereof. The 
method further includes determining the binding of said breast cancer modulatory protein or 
fragment thereof and said antibody. Wherein there is a change in binding, an agent is 
identified as an interfering agent The interfering agent can be an agonist or an antagonist 

30 Preferably, the agent inhibits breast cancer. 
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Also provided herein are methods of eliciting an immune response in an 
individual. In one embodiment a method provided herein comprises administering to an 
individual a composition comprising a breast cancer modulating protein, or a fragment 
thereof In another embodiment, the protein is encoded by a nucleic acid selected from those 
5 of Tables 1-25. 

Further provided herein are compositions capable of eliciting an immune 
response in an individual. In one embodiment, a composition provided herein comprises a 
breast cancer modulating protein, preferably encoded by a nucleic acid of Tables 1-25, more 
preferably of Table 25, or a fragment thereof and a pharmaceutically acceptable carrier. In 
10 another embodiment, said composition comprises a nucleic acid comprising a sequence 
encoding a breast cancer modulating protein, preferably selected from the nucleic acids of 
Tables 1-25, and a pharmaceutically acceptable carrier. 

Also provided are methods of neutralizing the effect of a breast cancer protein, 
or a fragment thereof, comprising contacting an agent specific for said protein with said 
1 5 protein in an amount sufficient to effect neutralization. In another embodiment, the protein is 
encoded by a nucleic acid selected from thoseof Tables 1-25. 

In another aspect of the invention, a method of treating an individual for breast 
cancer is provided. In one embodiment, the method comprises administering to said 
individual an inhibitor of a breast cancer modulating protein. In another embodiment, the 
20 method comprises administering to a patient having breast cancer an antibody to a breast 

cancer modulating protein conjugated to a therapeutic moiety. Such a therapeutic moiety can 
be a cytotoxic agent or a radioisotope. 



DETAILED DESCRIPTION OF THE INVENTION 
25 In accordance with the objects outlined above, the present invention provides 

novel methods for diagnosis and prognosis evaluation for breast cancer (PC), including 
metastatic breast cancer, as well as methods for screening for compositions which modulate 
breast cancer. Also provided are methods for treating breast cancer. 

Tables 1-24B provide unigene cluster identification numbers for the 
30 nucleotide sequence of genes that exhibit increased or decreased expression in breast cancer 

9 
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samples. Tables 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 18, 19, 20, 21, and 22 list those genes that are 
up-regulated in breast cancer cells. Table 14 lists those genes that are highly upregulated in 
breast cancer cells. Table 1, 2, 3, 15, and 23 list genes that are down-regulated in breast 
cancer cells and Table 16, lists genes that are highly down-regulated in breast cancer genes. 
5 The Tables also provide an exemplar accession number that provides a nucleotide sequence 
that is part of the unigene cluster. 

Definitions 

The term '"breast cancer protein" or "breast cancer polynucleotide" or c 'breast 

1 0 cancer-associated transcript" refers to nucleic acid and polypeptide polymorphic variants, 
alleles, mutants, and interspecies homologies that: (1) have a nucleotide sequence that has 
greater than about 60% nucleotide sequence identity, 65%, 70%, 75%, 80%, 85%, 90%, 
preferably 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% or greater nucleotide 
sequence identity, preferably over a region of over a region of at least about 25, 50, 100, 200, 

15 500, 1000, or more nucleotides, to a nucleotide sequence of or associated with a gene of 

Tables 1-25; (2) bind to antibodies, e.g., polyclonal antibodies, raised against an immunogen 
comprising an amino acid sequence encoded by a nucleotide sequence of or associated with a 
gene of Tables 1-25, and conservatively modified variants thereof, (3) specifically hybridize 
under stringent hybridization conditions to a nucleic acid sequence, or the complement 

20 thereof of Tables 1-25 and conservatively modified variants thereof or (4) have an amino acid 
sequence that has greater than about 60% amino acid sequence identity, 65%, 70%, 75%, 
80%, 85%, 90%, preferably 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% or greater 
amino sequence identity, preferably over a region of over a region of at least about 25, 50, 
100, 200, 500, 1000, or more amino acid, to an amino acid sequence encoded by a nucleotide 

25 sequence of or associated with a gene of Tables 1 -25 . A polynucleotide or polypeptide 
sequence is typically from a mammal including, but not limited to, primate, e.g., human; 
rodent, e.g., rat, mouse, hamster; cow, pig, horse, sheep, or other mammal. A "breast cancer 
polypeptide" and a "breast cancer polynucleotide," include both naturally occurring or 
recombinant forms. 

10 
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A "fall length" breast cancer protein or nucleic acid refers to a breast cancer 
polypeptide or polynucleotide sequence, or a variant thereof, that contains all of the elements 
normally contained in one or more naturally occurring, wild type breast cancer 
polynucleotide or polypeptide sequences. The "fall length" may be prior to, or after, various 
5 stages of post-translation processing or splicing, including alternative splicing. 

"Biological sample'* as used herein is a sample of biological tissue or fluid that 
contains nucleic acids or polypeptides, e.g., of a breast cancer protein, polynucleotide or 
transcript. Such samples include, but are not limited to, tissue isolated from primates, e.g., 
humans, or rodents, e.g., mice, and rats. Biological samples may also include sections of 

10 tissues such as biopsy and autopsy samples, frozen sections taken for histologic purposes, 
blood, plasma, serum, sputum, stool, tears, mucus, hair, skin, etc* Biological samples also 
include explants and primary and/or transformed cell cultures derived from patient tissues. A 
biological sample is typically obtained from a eukaryotic organism, most preferably a 
mammal such as a primate e.g., chimpanzee or human; cow; dog; cat; a rodent, e.g., guinea 

1 5 pig, rat, mouse; rabbit; or a bird; reptile; or fish. 

"Providing a biological sample" means to obtain a biological sample for use in 
methods described in tins invention. Most often, this will be done by removing a sample of 
cells from an animal, but can also be accomplished by using previously isolated cells (e.g. 9 
isolated by another person, at another time, and/or for another purpose), or by performing the 

20 methods of the invention in vivo. Archival tissues, having treatment or outcome history, will 
be particularly useful. 

The terms "identical" or percent "identity," in the context of two or more 
nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences that 
are the same or have a specified percentage of amino acid residues or nucleotides that are the 

25 same (i.e., about 60% identity, preferably 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 
95%, 96%, 97%, 98%, 99%, or higher identity over a specified region, when compared and 
aligned for maximum correspondence over a comparison window or designated region) as 
measured using a BLAST or BLAST 2.0 sequence comparison algorithms with default 
parameters described below, or by manual alignment and visual inspection {see, e.g., NCBI 

30 web site ht^:/Avww.ncbi.nlm.nih.gov/BLAST/ or the like). Such sequences are then said to 

11 
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be "substantially identical." This definition also refers to, or may be applied to, the 
compliment of a test sequence. The definition also includes sequences that have 'deletions 
and/or additions, as well as those that have substitutions, as well as naturally occurring, e.g., 
polymorphic or allelic variants, and man-made variants. As described below, the preferred 
5 algorithms can account for gaps and the like. Preferably, identity exists over a region that is 
at least about 25 amino acids or nucleotides in length, or more preferably over a region that is 
50-1 00 amino acids, or nucleotides in length. 

For sequence comparison, typically one sequence acts as a reference sequence, 
to which test sequences are compared. When using a sequence comparison algorithm, test 

1 0 and reference sequences are entered into a computer, subsequence coordinates are designated, 
if necessary, and sequence algorithm program parameters are designated. Preferably, default 
program parameters can be used, or alternative parameters can be designated. The sequence 
comparison algorithm then calculates the percent sequence identities for the test sequences 
relative to the reference sequence, based on the program parameters. 

15 A "comparison window**, as used herein, includes reference to a segment of 

one of the number of contiguous positions selected from the group consisting typically of 
from 20 to 600, usually about 50 to about 200, more usually about 100 to about 150 in which 
a sequence may be compared to a reference sequence of the same number of contiguous 
positions after the two sequences are optimally aligned. Methods of alignment of sequences 

20 for comparison are well-known in the art Optimal alignment of sequences for comparison 
can be conducted, e.g., by the local homology algorithm of Smith & Waterman, Adv. AppL 
Math. 2:482 (1981), by the homology alignment algorithm of Needleman & Wunsch, /. Mol 
Biol 48:443 (1970), by the search for similarity method of Pearson & Lipman, Proa Nat'L 
Acad. Sci. USA 85:2444 (1988), by computerized implementations of these algorithms 

25 (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, 
Genetics Computer Group, 575 Science Dr., Madison, WI), or by manual alignment and 
visual inspection (see, e*g., Current Protocols in Molecular Biology (Ausubel et aL, eds. 
1995 supplement)). 

Preferred examples of algorithms that are suitable for determining percent 

30 sequence identity and sequence similarity include the BLAST and BLAST 2.0 algorithms, 

12 
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which are described in Altschul et al, Nuc. Acids Res. 25:3389-3402 (1977) and Altschul et 
aL, J. Mol. Biol 215:403-410 (1990). BLAST and BLAST 2.0 are used, with the parameters 
described herein, to determine percent sequence identity for the nucleic acids and proteins of 
the invention. Software for performing BLAST analyses is publicly available through the 
5 National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/)- This 

algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short 
words of length Win the query sequence, which either match or satisfy some positive-valued 
threshold score T when aligned with a word of the same length in a database sequence. T is 
referred to as the neighborhood word score threshold (Altschul et aL 9 supra). These initial 

10 neighborhood word hits act as seeds for initiating searches to find longer HSPs containing 
them. The word hits are extended in both directions along each sequence for as far as the 
cumulative alignment score can be increased Cumulative scores are calculated using, e.g., 
for nucleotide sequences, the parameters M (reward score for a pair of matching residues; 
always > 0) and N (penalty score for mismatching residues; always < 0). For amino acid 

15 sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word 
hits in each direction are halted when: the cumulative alignment score falls off by the 
quantity X from its maximum achieved value; the cumulative score goes to zero or below, 
due to the accumulation of one or more negative-scoring residue alignments; or the end of 
either sequence is reached. The BLAST algorithm parameters W, T, and X determine the 

20 sensitivity and speed of the alignment The BLASTN program (for nucleotide sequences) 
uses as defaults a wordlength (W) of 11, an expectation (E) of 10, M=5,N=-4 and a 
comparison of both strands. For amino acid sequences, the BLASTP program uses as 
defaults a wordlength of 3, and expectation (E) of 10, and the BLOSUM62 scoring matrix 
(see Henikoff & Henikoft Proc. Natl Acad. Sci USA 89:10915 (1989)) alignments (B) of 

25 50, expectation (E) of 10, M=5, N=-4, and a comparison of both strands. 

The BLAST algorithm also performs a statistical analysis of the similarity 
between two sequences (see, e.g., Karlin & Altschul, Proa Natl Acad. Set USA 90:5873- 
5787 (1993)). One measure of similarity provided by the BLAST algorithm is the smallest 
sum probability (P(N)), which provides an indication of the probability by which a match 

30 between two nucleotide or amino acid sequences would occur by chance. For example, a 
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nucleic acid is considered similar to a reference sequence if the smallest sum probability in a 
comparison of the test nucleic acid to the reference nucleic acid is less than about 0.2, more 
preferably less than about 0.01, and most preferably less than about 0.001 . Log values may 
be large negative numbers, e.g., 5, 10, 20, 30, 40, 40, 70, 90, 1 10, 150, 170, etc, 
5 An indication that two nucleic acid sequences or polypeptides are substantially 

identical is that the polypeptide encoded by the first nucleic acid is immunologically cross 
reactive with the antibodies raised against the polypeptide encoded by the second nucleic 
acid, as described below. Thus, a polypeptide is typically substantially identical to a second 
polypeptide, e.g., where the two peptides differ only by conservative substitutions. Another 

10 indication that two nucleic acid sequences are substantially identical is that the two molecules 
or their complements hybridize to each other under stringent conditions, as described below. 
Yet another indication that two nucleic acid sequences are substantially identical is that the 
same primers can be used to amplify the sequences. 

A "host cell" is a naturally occurring cell or a transformed cell that contains an 

1 5 expression vector and supports the replication or expression of the expression vector. Host 
cells may be cultured cells, explants, cells in vrvo, and the like. Host cells may be 
prokaryotic cells such as E. coli, or eukaryotic cells such as yeast, insect, amphibian, or 
mammalian cells such as CHO, HeLa, and the like (see, eg,, the American Type Culture 
Collection catalog or web site, www.atcc.org). 

20 The terms 'isolated,** purified/* or "biologically pure" refer to material that is 

substantially or essentially free from components that normally accompany it as found in its 
native state. Purity and homogeneity are typically determined using analytical chemistry 
techniques such as polyacrylamide gel electrophoresis or high performance liquid 
chromatography. A protein or nucleic acid that is the predominant species present in a 

25 preparation is substantially purified. In particular, an isolated nucleic acid is separated from 
some open reading frames that naturally flank the gene and encode proteins other than protein 
encoded by the gene. The term "purified" in some embodiments denotes that a nucleic acid 
or protein gives rise to essentially one band in an electrophoretic gel. Preferably, it means 
that the nucleic acid or protein is at least 85% pure, more preferably at least 95% pure, and 

30 most preferably at least 99% pure. "Purify" or "purification" in other embodiments means 
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removing at least one contaminant from the composition to be purified. In this sense, 
purification does not require that the purified compound be homogenous, e.g., 100% pure. 

The terms polypeptide," peptide" and protein" are used interchangeably 
herein to refer to a polymer of amino acid residues. The terms apply to amino acid polymers 
5 in which one or more amino acid residue is an artificial chemical mimetic of a corresponding 
naturally occurring amino acid, as well as to naturally occurring amino acid polymers, those 
containing modified residues, and non-naturally occurring amino acid polymer. 

The term "amino acid" refers to naturally occurring and synthetic amino acids, 
as well as amino acid analogs and amino acid mimetics that function similarly to the naturally 

10 occurring amino acids. Naturally occurring amino acids are those encoded by the genetic 
code, as well as those amino acids that are later modified, e.g., hydroxyproline, y- 
carboxyglutamate, and O-phosphoserine. Amino acid analogs refers to compounds that have 
the same basic chemical structure as a naturally occurring amino acid, e.g., an a carbon that is 
bound to a hydrogen, a carboxyl group, an amino group, and an R group, e.g., homoserine, 

1 5 norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs may have 

modified R groups (e.g., norleucine) or modified peptide backbones, but retain the same basic 
chemical structure as a naturally occurring amino acid. Amino acid mimetics refers to 
chemical compounds that have a structure that is different from the general chemical 
structure of an amino acid, but that functions similarly to a naturally occurring amino acid. 

20 Amino acids may be referred to herein by either their commonly known three 

letter symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical 
Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly 
accepted single-letter codes. 

"Conservatively modified variants" applies to both amino acid and nucleic 

25 acid sequences. With respect to particular nucleic acid sequences, conservatively modified 
variants refers to those nucleic acids which encode identical or essentially identical amino 
acid sequences, or where the nucleic acid does not encode an amino acid sequence, to 
essentially identical or associated, e.g., naturally contiguous, sequences. Because of the 
degeneracy of the genetic code, a large number of functionally identical nucleic acids encode 

30 most proteins. For instance, the codons GCA, GCC, GCG and GCU all encode the amino 
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acid alanine. Thus, at every position where an alanine is specified by a codon, the codon can 
be altered to another of the corresponding codons described without altering the encoded 
polypeptide. Such nucleic acid variations are "silent variations," which are one species of 
conservatively modified variations. Every nucleic acid sequence herein which encodes a 
5 polypeptide also describes silent variations of the nucleic acid. One of skill will recognize 
that in certain contexts each codon in a nucleic acid (except AUG, which is ordinarily the 
only codon for methionine, and TGG, which is ordinarily the only codon for tryptophan) can 
be modified to yield a functionally identical molecule. Accordingly, often silent variations of 
a nucleic acid which encodes a polypeptide is implicit in a described sequence with respect to 

10 the expression product, but not with respect to actual probe sequences. 

As to amino acid sequences, one of skill will recognize that individual 
substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein 
sequence which alters, adds or deletes a single amino acid or a small percentage of amino 
acids in the encoded sequence is a "conservatively modified variant" where the alteration 

IS results in the substitution of an amino acid with a chemically similar amino acid. 

Conservative substitution tables providing functionally similar amino acids are well known in 
the art. Such conservatively modified variants are in addition to and do not exclude 
polymorphic variants, interspecies homologs, and alleles of the invention. typically 
conservative substitutions for one another: 1) Alanine (A), Glycine (G); 2) Aspartic acid (D), 

20 Glutamic acid (E); 3) Asparagine (N), Glutamine (Q); 4) Alanine (R), Lysine (K); 5) 

Isoleucine (I), Leucine (L), Methionine (M), Valine (V); 6) Phenylalanine (F), Tyrosine (Y), 
Tryptophan (W); 7) Serine (S), Threonine (T); and 8) Cysteine (Q, Methionine (M) {see, 
e.g, Creighton, Protons (1984)). 

Macromolecular structures such as polypeptide structures can be described in 

25 terms of various levels of organization. For a general discussion of this organization, see, 
eg, Alberts et al 9 Molecular Biology of the Cell (3 rf ed., 1994) and Cantor & Schimmel, 
Biophysical Chemistry Part I: The Conformation of Biological Macromolecules (1980). 
"Primary structure*' refers to the amino acid sequence of a particular peptide. "Secondary 
structure" refers to locally ordered, three dimensional structures within a polypeptide. These 

30 structures are commonly known as domains. Domains are portions of a polypeptide that 
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often form a compact unit of the polypeptide and are typically 25 to approximately 500 
amino acids long. Typical domains are made up of sections of lesser organization such as 
stretches of p-sheet and a-helices. 'Tertiary structure" refers to the complete three 
dimensional structure of a polypeptide monomer. "Quaternary structure" refers to the three 
5 dimensional structure formed, usually by the noncovalent association of independent tertiary 
units. Anisotropic terms are also known as energy terms. 

''Nucleic acid" or "oligonucleotide" or "polynucleotide" or grammatical 
equivalents used herein means at least two nucleotides covalently linked together. 
Oligonucleotides are typically from about 5, 6, 7, 8, 9, 10, 12, 15, 25, 30, 40, 50 or more 

1 0 nucleotides in length, up to about 1 00 nucleotides in length. Nucleic acids and 

polynucleotides are a polymers of any length, including longer lengths, e.g., 200, 300, 500, 
1000, 2000, 3000, 5000, 7000, 10,000, etc. A nucleic acid of the present invention will 
generally contain phosphodiester bonds, although in some cases, nucleic acid analogs are 
included that may have alternate backbones, comprising, e.g., phosphoramidate, 

15 phosphorothioate, phosphorodithioate, or O-methylphophoroamidite linkages (see Eckstein, 
Oligonucleotides and Analogues: A Practical Approach, Oxford University Press); and 
peptide nucleic acid backbones and linkages. Other analog nucleic acids include those with 
positive backbones; non-ionic backbones, and non-ribose backbones, including those 
described in U.S. Patent Nos. 5,235,033 and 5,034,506, and Chapters 6 and 7, ASC 

20 Symposium Series 580, Carbohydrate Modifications in Jintisense Research, Sanghui & 

Cook, eds.. Nucleic acids containing one or more carbocyclic sugars are also included within 
one definition of nucleic acids. Modifications of the ribose-phosphate backbone may be done 
for a variety of reasons, e.g. to increase the stability and half-life of such molecules in 
physiological environments or as probes on a biochip. Mixtures of naturally occurring 

25 nucleic acids and analogs can be made; alternatively, mixtures of different nucleic acid 
analogs, and mixtures of naturally occurring nucleic acids and analogs may be made. 

A variety of references disclose such nucleic add analogs, including, for 
example, phosphoramidate (Beaucage et al., Tetrahedron 49(10):1925 (1993) and references 
therein; Letsinger, J. Org. Chem. 35:3800 (1970); Sprinzl et al., Eur. J. Biochem. 81:579 

30 (1977); Letsinger et al., Nucl. Acids Res. 14:3487 (1986); Sawai et al, Chem. Lett. 805 
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(1984), Letsinger et al., J. Am. Chem. Soc. 1 10:4470 (1988); and Pauwels et al., Chemica 
Scripta 26:141 91986)), phosphorothioate (Mag et aL, Nucleic Acids Res. 19:1437 (1991); 
and U.S. Patent No. 5,644,048), phosphorodithioate (Briu et al., J. Am. Chem. Soc. 1 1 1 :2321 
(1989), O-methylphophoroamidite linkages (see Eckstein, Oligonucleotides and Analogues: 
5 A Practical Approach, Oxford University Press), and peptide nucleic acid backbones and 
linkages (see Egholm, J. Am. Chem. Soc. 1 14:1895 (1992); Meier et al., Chem. Int Ed Engl. 
31:1008 (1992); Nielsen, Nature, 365:566 (1993); Carlsson et al., Nature 380:207 (1996), all 
of which are incorporated by reference). Other analog nucleic acids include those with 
positive backbones (Denpcy et al., Proc. Natl. Acad Sci. USA 92:6097 (1995); non-ionic 

10 backbones (U.S. Patent Nos. 5,386,023, 5,637,684, 5,602,240, 5,216,141 and 4,469,863; 
Kiedrowshi et al., Angew. Chem. Intl. Ed. English 30:423 (1991); Letsinger et al., J. Am. 
Chem. Soc. 1 10:4470 (1988); Letsinger et al., Nucleoside & Nucleotide 13:1597 (1994); 
Chapters 2 and 3, ASC Symposium Series 580, "Carbohydrate Modifications in Antisense 
Research", Ed. Y.S. Sanghui and P. Dan Cook; Mesmaeker et al., Bioorganic & Medicinal 

15 Chem. Lett 4:395 (1994); Jeffe et al., J. Biomolecular NMR 34:17 (1994); Tetrahedron Lett. 
37:743 (1996)) and non-ribose backbones, including those described in U.S. Patent Nos. 
5,235,033 and 5,034,506, and Chapters 6 and 7, ASC Symposium Series 580, "Carbohydrate 
Modifications in Antisense Research", Ed. Y.S. Sanghui and P. Dan Cook. Nucleic acids 
containing one or more carbocyclic sugars are also included within one definition of nucleic 

20 acids (see Jenkins et aL, Chem. Soc. Rev. (1995) pp 169-176). Several nucleic acrid analogs 
are described in Rawls, C&E News June 2, 1997 page 35. All of these references are hereby 
expressly incorporated by reference. 

Particularly preferred are peptide nucleic acids (PNA) which includes peptide 
nucleic acid analogs. These backbones are substantially non-ionic under neutral conditions, in 

25 contrast to the highly charged phosphodiester backbone of naturally occurring nucleic acids. 
This results in two advantages. First, the PNA backbone exhibits improved hybridization 
kinetics. PNAs have larger changes in the melting temperature (TnO for mismatched versus 
perfectly matched basepairs. DNA and RNA typically exhibit a 2-4°C drop in T m for an 
internal mismatch. With the non-ionic PNA backbone, the drop is closer to 7-9°C. Similarly, 

30 due to their non-ionic nature, hybridization of the bases attached to these backbones is 
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relatively insensitive to salt concentration. In addition, PNAs are not degraded by cellular 
enzymes, and thus can be more stable. 

The nucleic acids may be single stranded or double stranded, as specified, or 
contain portions of both double stranded or single stranded sequence. As will be appreciated 
5 by those in the art, the depiction of a single strand also defines the sequence of die 

complementary strand; thus the sequences described herein also provide the complement of 
the sequence. The nucleic acid may be DNA, both genomic and cDNA, RNA or a hybrid, 
where the nucleic acid may contain combinations of deoxyribo- and ribo-nucleotides, and 
combinations of bases, including uracil, adenine, thymine, cytosine, guanine, inosine, 

10 xanthine hypoxanthine, isocytosine, isoguanine, etc. 'Transcript" typically refers to a 

naturally occurring RNA, e.g., a pre-mRNA, hnRNA, or mRNA. As used herein, the term 
"nucleoside" includes nucleotides and nucleoside and nucleotide analogs, and modified 
nucleosides such as amino modified nucleosides. In addition, '^nucleoside" includes non- 
naturally occurring analog structures. Thus, e.g. the individual units of a peptide nucleic 

15 acid, each containing a base, are referred to herein as a nucleoside. 

A "label" or a "detectable moiety" is a composition detectable by 
spectroscopic, photochemical, biochemical, immunochemical, chemical, or other physical 
means. For example, useful labels include 32 P, fluorescent dyes, electron-dense reagents, 
enzymes (e.g., as commonly used in an ELISA), biotin, digoxigenin, or haptens and proteins 

20 or other entities which can be made detectable, by incorporating a radiolabel into the 
peptide or used to detect antibodies specifically reactive with the peptide. The labels may be 
incorporated into the breast cancer nucleic acids, proteins and antibodies at any position. 
Any method known in the art for conjugating the antibody to the label may be employed, 
including those methods described by Hunter et al., Nature, 144:945 (1962); David et al., 

25 Biochemistry. 13:1014 (1974); Pain et al., T Immunol. Meth.. 40:219 (1981); andNygren, L 
Histochem. and Cvtochenu 30:407 (1982). 

An "effector* * or "effector moiety" or "effector component" is a molecule that 
is bound (or linked, or conjugated), either covalently, through a linker or a chemical bond, or 
noncovalently, through ionic, van der Waals, electrostatic, or hydrogen bonds, to an antibody. 

30 The "effector** can be a variety of molecules including, e.g., detection moieties including 
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radioactive compounds, fluorescent compounds, an enzyme or substrate, tags such as epitope 
tags, a toxin; activatable moieties, a chemotherapeutic agent; a lipase; an antibiotic; or a 
radioisotope emitting "hard" e.g., beta radiation. 

A "labeled nucleic acid probe or oligonucleotide" is one that is bound, either 

5 covalently, through a linker or a chemical bond, or noncovalently, through ionic, van der 
Waals, electrostatic, or hydrogen bonds to a label such that the presence of the probe may be 
detected by detecting the presence of the label bound to the probe. Alternatively, method 
using high affinity interactions may achieve the same results where one of a pair of binding 
partners binds to the other, e.g., biotin, streptavidin. 

10 As used herein a **nucleic acid probe or oligonucleotide" is defined as a 

nucleic acid capable of binding to a target nucleic acid of complementary sequence through 
one or more types of chemical bonds, usually through complementary base pairing, usually 
through hydrogen bond formation. As used herein, a probe may include natural (i.e., A, G, C, 
or T) or modified bases (7-deazaguanosine, inosine, etc.). In addition, the bases in a probe 

1 5 may be joined by a linkage other than a phosphodiester bond, so long as it does not 

functionally interfere with hybridization. Thus, e.g., probes may be peptide nucleic acids in 
which the constituent bases are joined by peptide bonds rather than phosphodiester linkages. 
It will be understood by one of skill in the art that probes may bind target sequences lacking 
complete complementarity with the probe sequence depending upon the stringency of the 

20 hybridization conditions. The probes are preferably directly labeled as with isotopes, 

chromophores, lumiphores, chromogens, or indirectly labeled such as with biotin to which a 
streptavidin complex may later bind. By assaying for the presence or absence of the probe, 
one can detect the presence or absence of the select sequence or subsequence. Diagnosis or 
prognosis may be based at the genomic level, or at the level of RNA or protein expression. 

25 Hie term Recombinant" when used with reference, e.g., to a cell, or nucleic 

acid, protein, or vector, indicates that the cell, nucleic acid, protein or vector, has been 
modified by the introduction of a heterologous nucleic acid or protein or the alteration of a 
native nucleic acid or protein, or that the cell is derived from a cell so modified. Thus, e.g., 
recombinant cells express genes that are not found within the native (non-recombinant) form 

30 of the cell or express native genes that are otherwise abnormally expressed, under expressed 
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or not expressed at all. By the term "recombinant nucleic acid" herein is meant nucleic acid, 
originally formed in vitro, in general, by the manipulation of nucleic acid, e.g., using 
polymerases and endonucleases, in a form not normally found in nature. In this manner, 
operably linkage of different sequences is achieved Thus an isolated nucleic acid, in a linear 
5 form, or an expression vector formed in vitro by ligating DNA molecules that are not 
normally joined, are both considered recombinant for the purposes of this invention. It is 
understood that once a recombinant nucleic acid is made and reintroduced into a host cell or 
organism, it will replicate non-recombinantly, i.e,, using the in vivo cellular machinery of the 
host cell rather than in vitro manipulations; however, such nucleic acids, once produced 

10 recombinantly, although subsequently replicated non-recombinantly, are still considered 

recombinant for the purposes of the invention. Similarly, a "recombinant protein" is a protein 
made using recombinant techniques, i.e., through the expression of a recombinant nucleic 
acid as depicted above. 

The term "heterologous" when used with reference to portions of a nucleic 

15 acid indicates that the nucleic acid comprises two or more subsequences that are not normally 
found in the same relationship to each other in nature. For instance, the nucleic acid is 
typically recombinantly produced, having two or more sequences, e.g., from unrelated genes 
arranged to make a new functional nucleic acid, e.g., a promoter from one source and a 
coding region from another source. Similarly, a heterologous protein will often refer to two 

20 or more subsequences that are not found in the same relationship to each other in nature (e.g., 
a fusion protein). 

A "promoter" is defined as an array of nucleic acid control sequences that 
direct transcription of a nucleic acid As used herein, a promote: includes necessary nucleic 
acid sequences near the start site of transcription, such as, in the case of a polymerase II type 

25 promoter, a TATA element A promoter also optionally includes distal enhancer or repressor 
elements, which can be located as much as several thousand base pairs from the start site of 
transcription. A "constitutive" promoter is a promoter that is active under most 
environmental and developmental conditions. An "inducible" promoter is a promoter that is 
active under environmental or developmental regulation. The term "operably linked" refers 

30 to a functional linkage between a nucleic acid expression control sequence (such as a 
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promoter, or array of transcription factor binding sites) and a second nucleic acid sequence, 
wherein the expression control sequence directs transcription of the nucleic acid 
corresponding to the second sequence. 

An "expression vector" is a nucleic acid construct, generated recombinantly or 

5 synthetically, with a series of specified nucleic acid elements that permit transcription of a 
particular nucleic acid in a host celL The expression vector can be part of a plasmid, virus, or 
nucleic acid fragment Typically, the expression vector includes a nucleic acid to be 
transcribed operably linked to a promoter. 

The phrase "selectively (or specifically) hybridizes to" refers to the binding, 

1 0 duplexing, or hybridizing of a molecule only to a particular nucleotide sequence under 

stringent hybridization conditions when that sequence is present in a complex mixture (e.g., 
total cellular or library DNA or RNA). 

The phrase "stringent hybridization conditions" refers to conditions under 
which a probe will hybridize to its target subsequence, typically in a complex mixture of 

1 5 nucleic acids, but to no other sequences. Stringent conditions are sequence-<iependent and 
will be different in different circumstances. Longer sequences hybridize specifically at 
higher temperatures. An extensive guide to the hybridization of nucleic acids is found in 
Tijssen, Techniques in Biochemistry and Molecular Biology-Hybridization with Nucleic 
Probes, "Overview of principles of hybridization and the strategy of nucleic acid assays" 

20 (1993). Generally, stringent conditions are selected to be about 5-10°C lower than the 

thermal melting point (Tm) for the specific sequence at a defined ionic strength pH. The T m is 
the temperature (under defined ionic strength, pH, and nucleic concentration) at which 50% 
of the probes complementary to the target hybridize to the target sequence at equilibrium (as 
the target sequences are present in excess, at Tm, 50% of the probes are occupied at 

25 equilibrium). Stringent conditions will be those in which the salt concentration is less than 
about 1 .0 M sodium ion, typically about 0.01 to 1 .0 M sodium ion concentration (or other 
salts) at pH 7.0 to 8.3 and the temperature is at least about 30°C for short probes (e.g., 10 to 
50 nucleotides) and at least about 60°C for long probes (e.g., greater than 50 nucleotides). 
Stringent conditions may also be achieved with the addition of destabilizing agents such as 

30 formamide. For selective or specific hybridization, a positive signal is at least two times 
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background, preferably 10 times background hybridization. Exemplary stringent 
hybridization conditions can be as following: 50% formamide, 5x SSC, and 1% SDS, 
incubating at 42°C, or, 5x SSC, 1% SDS, incubating at 65°C, with wash in 0.2x SSC, and 
0.1% SDS at 65°C. For PCR, a temperature of about 36°C is typical for low stringency 
5 amplification, although annealing temperatures may vary between about 32°C and 48°C 
depending on prima- length. For high stringency PCR amplification, a temperature of about 
62°C is typical, although high stringency annealing temperatures can range from about 50°C 
to about 65°C, depending on the primer length and specificity. Typical cycle conditions for 
both high and low stringency amplifications include a denaturation phase of 90°C - 95°C for 

10 30 sec - 2 min., an annealing phase lasting 30 sec. - 2 min., and an extension phase of about 
72°C for 1 - 2 min. Protocols and guidelines for low and high stringency amplification 
reactions are provided, e.g., in Innis et al (1990) PCR Protocols, A Guide to Methods and 
Applications, Academic Press, Inc. N.Y.). 

Nucleic acids that do not hybridize to each other under stringent conditions are 

15 still substantially identical if the polypeptides which they encode are substantially identical. 
This occurs, e.g., when a copy of a nucleic acid is created using the maximum codon 
degeneracy permitted by the genetic code. In such cases, the nucleic acids typically hybridize 
under moderately stringent hybridization conditions. Exemplary "moderately stringent 
hybridization conditions*' include a hybridization in a buffer of 40% formamide, 1 M NaCl, 

20 1% SDS at 37°C, and a wash in IX SSC at 45°C. A positive hybridization is at least twice 
background. Those of ordinary skill will readily recognize that alternative hybridization and 
wash conditions can be utilized to provide conditions of similar stringency. Additional 
guidelines for detennining hybridization parameters are provided in numerous reference, e.g., 
and Current Protocols in Molecular Biology, e& Ausubel, et al 

25 The phrase 'functional effects'* in the context of assays for testing compounds 

that modulate activity of a breast cancer protein includes the determination of a parameter 
that is indirectly or directly under the influence of the breast cancer protein or nucleic acid, 
e.g., a functional, physical, or chemical effect, such as the ability to decrease breast cancer. It 
includes ligand binding activity, cell growth on soft agar; anchorage dependence; contact 
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inhibition and density limitation of growth; cellular proliferation; cellular transformation; 
growth factor or serum dependence; tumor specific marker levels; invasiveness into Matrigel; 
tumor growth and metastasis in vivo; mRNA and protein expression in cells undergoing 
metastasis, and other characteristics of breast cancer cells. "Functional effects" include in 

5 vitro, in vivo, and ex vivo activities. 

By "determining the functional effect" * is meant assaying for a compound that 
increases or decreases a parameter that is indirectly or directly under the influence of a breast 
cancer protein sequence, e.g., functional, enzymatic, physical and chemical effects. Such 
functional effects can be measured by any means known to those skilled in the art, e.g., 

10 changes in spectroscopic characteristics (e.g., fluorescence, absorbance, refractive index), 
hydrodynamic (e.g., shape), chromatographic, or solubility properties for the protein, 
measuring inducible markers or transcriptional activation of the breast cancer protein; 
measuring binding activity or binding assays, e.g. binding to antibodies or other ligands, and 
measuring cellular proliferation. Determination of the functional effect of a compound on 

15 breast cancer can also be performed using breast cancer assays known to those of skill in the 
art such as an in vitro assays, e.g., cell growth on soft agar, anchorage dependence; contact 
inhibition and density limitation of growth; cellular proliferation; cellular transformation; 
growth factor or serum dependence; tumor specific marker levels; invasiveness into Matrigel; 
tumor growth and metastasis in vivo; mRNA and protein expression in cells undergoing 

20 metastasis, and other characteristics of breast cancer cells. The functional effects can be 

evaluated by many means known to those skilled in the art, e.g., microscopy for quantitative 
or qualitative measures of alterations in morphological features, measurement of changes in 
RNA or protein levels for breast cancer-associated sequences, measurement ofRNA stability, 
identification of downstream or reporter gene expression (CAT, luciferase, p-gal, GFP and 

25 the like), e.g., via chemiluminescence, fluorescence, colorimetric reactions, antibody binding, 
inducible markers, and ligand binding assays. 

'Inhibitors", "activators", and "modulators" of breast cancer polynucleotide 
and polypeptide sequences are used to refer to activating, inhibitory, or modulating molecules 
or compounds identified using in vitro and in vivo assays of breast cancer polynucleotide and 

30 polypeptide sequences. Inhibitors are compounds that, e.g., bind to, partially or totally block 
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activity, decrease, prevent, delay activation, inactivate, desensitize, or down regulate the 
activity or expression of breast cancer proteins, e.g., antagonists. Antisense nucleic acids 
may seem to inhibit expression and subsequent function of the protein. "Activators" are 
compounds that increase, open, activate, facilitate, enhance activation, sensitize, agonize, or 

5 up regulate breast cancer protein activity. Inhibitors, activators, or modulators also include 
genetically modified versions of breast cancer proteins, e.g., versions with altered activity, as 
well as naturally occurring and synthetic ligands, antagonists, agonists, antibodies, small 
chemical molecules and the like. Such assays for inhibitors and activators include, e.g., 
expressing the breast cancer protein in vitro, in cells, or cell membranes, applying putative 

10 modulator compounds, and then determining the functional effects on activity, as described 
above. Activators and inhibitors of breast cancer can also be identified by incubating breast 
cancer cells with the test compound and determining increases or decreases in the expression 
of 1 or more breast cancer proteins, e.g., 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 50 or more breast 
cancer proteins, such as breast cancer proteins encoded by the sequences set out in Tables 1- 

15 25. 

Samples or assays comprising breast cancer proteins that are treated with a 
potential activator, inhibitor, or modulator are compared to control samples without the 
inhibitor, activator, or modulator to examine the extent of inhibition. Control samples 
(untreated with inhibitors) are assigned a relative protein activity value of 100%. Inhibition 

20 of a polypeptide is achieved when the activity value relative to the control is about 80%, 
preferably 50%, more preferably 25-0%. Activation of a breast cancer polypeptide is 
achieved when the activity value relative to the control (untreated with activators) is 1 10%, 
more preferably 150%, more preferably 200-500% (i.e., two to five fold higher relative to the 
control), more preferably 1000-3000% higher. 

25 The phrase "changes in cell growth" refers to any change in cell growth and 

proliferation characteristics in vitro or in vivo> such as formation of foci, anchorage 
independence, semi-solid or soft agar growth, changes in contact inhibition and density 
limitation of growth, loss of growth factor or serum requirements, changes in cell 
morphology, gaining or losing immortalization, gaining or losing tumor specific markers, 

30 ability to form or suppress tumors when injected into suitable animal hosts, and/or 
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immortalization of the cell. See, e.g., Fresbney, Culture of Animal Cells a Manual of Basic 
Technique pp. 231-241 (3 rf ed 1994). 

'Tumor cell" refers to precancerous, cancerous, and normal cells in a tumor. 
"Cancer cells," ''transformed" cells or 'transformation" in tissue culture, refers 
5 to spontaneous or induced phenotypic changes that do not necessarily involve the uptake of 
new genetic material. Although transformation can arise from infection with a transforming 
virus and incorporation of new genomic DNA, or uptake of exogenous DNA, it can also arise 
spontaneously or following exposure to a carcinogen, thereby mutating an endogenous gene. 
Transformation is associated with phenotypic changes, such as immortalization of cells, 

10 aberrant growth control, nonmorphological changes, and/or malignancy (see, Freshney, 
Culture of Animal Cells a Manual of Basic Technique (3 rd ed. 1994)). 

"Antibody" refers to a polypeptide comprising a framework region from an 
immunoglobulin gene or fragments thereof that specifically binds and recognizes an antigen. 
The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, 

1 5 epsilon, and mu constant region genes, as well as the myriad immunoglobulin variable region 
genes. Light chains are classified as either kappa or lambda. Heavy chains are classified as 
gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, 
IgM, IgA, IgP and IgE, respectively. Typically, the antigen-binding region of an antibody or 
its functional equivalent will be most critical in specificity and affinity of binding. See Paul, 

20 Fundamental Immunology. 

An exemplary immunoglobulin (antibody) structural unit comprises a 
tetramer. Each tetramer is composed of two identical pairs of polypeptide chains, each pair 
having one "light" (about 25 fcD) and one Tieavy" chain (about 50-70 kD). The N-terminus 
of each chain defines a variable region of about 100 to 1 10 or more amino acids primarily 

25 responsible for antigen recognition. The terms variable light chain (V L ) and variable heavy 
chain (Vh) refer to these light and heavy chains respectively. 

Antibodies exist, e.g., as intact immunoglobulins or as a number of well- 
characterized fragments produced by digestion with various peptidases. Thus, e.g., pepsin 
digests an antibody below the disulfide linkages in the hinge region to produce F(ab)*2, a 

30 dimer of Fab which itself is a light chain joined to V h -Ch1 by a disulfide bond. The F(ab)' 2 
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may be reduced under mild conditions to break the disulfide linkage in the hinge region, 
thereby converting the F(ab)* 2 dimer into an Fab' monomer. The Fab* monomer is 
essentially Fab with part of the hinge region (see Fundamental Immunology (Paul ed., 3d ed. 
1993). While various antibody fragments are defined in terms of the digestion of an intact 
5 antibody, one of skill will appreciate that such fragments may be synthesized de novo either 
chemically or by using recombinant DNA methodology. Thus, the term antibody, as used 
herein, also includes antibody fragments either produced by the modification of whole 
antibodies, or those synthesized de novo using recombinant DNA methodologies (e.g., single 
chain Fv) or those identified using phage display libraries (see, e.g, McCafferty et aL, Nature 

10 348:552-554 (1990)) 

For preparation of antibodies, e.g., recombinant, monoclonal, or polyclonal 
antibodies, many technique known in the art can be used (see, e.g., Kohler & Milstein, 
Nature 256:495-497 (1975); Kozbor et al, Immunology Today 4:72 (1983); Cole et aL, pp. 
77-96 in Monoclonal Antibodies and Cancer Therapy (1985); Coligan, Current Protocols in 

15 Immunology (1991); Harlow & Lane, Antibodies, A Laboratory Manual (1988); and Goding, 
Monoclonal Antibodies: Principles and Practice (2d ed. 1986)). Techniques for the 
production of single chain antibodies (U.S. Patent 4,946,778) can be adapted to produce 
antibodies to polypeptides of this invention. Also, transgenic mice, or other organisms such 
as other mammals, may be used to express humanized antibodies. Alternatively, phage 

20 display technology can be used to identify antibodies and heteromeric Fab fragments that 
specifically bind to selected antigens (see, e.g, McCafferty et al 9 Nature 348:552-554 
(1990); Marks et al y Biotechnology 10:779-783 (1992)). 

A "chimeric antibody** is an antibody molecule in which (a) the constant 
region, or a portion thereof is altered, replaced or exchanged so that the antigen binding site 

25 (variable region) is linked to a constant region of a different or altered class, effector function 
and/or species, or an entirely different molecule which confers new properties to the chimeric 
antibody, e.g. 9 an enzyme, toxin, hormone, growth factor, drug, etc.; or (b) the variable 
region, or a portion thereof; is altered, replaced or exchanged with a variable region having a 
different or altered antigen specificity. 

30 
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Identification of breast cancer-associated sequences 

In one aspect, the expression levels of genes are determined in different 
patient samples for which diagnosis information is desired, to provide expression profiles. 
An expression profile of a particular sample is essentially a "fingerprint" of the state of the 
5 sample; while two states may have any particular gene similarly expressed, the evaluation of 
a number of genes simultaneously allows the generation of a gene expression profile that is 
characteristic of the state of the cell. That is, normal tissue (e.g., normal breast or other 
tissue) may be distinguished from cancerous or metastatic cancerous tissue of the breast, or 
breast cancer tissue or metastatic breast cancerous tissue can be compared with tissue 
10 samples of breast and other tissues from surviving cancer patients. By comparing expression 
profiles of tissue in known different breast cancer states, information regarding which genes 
are important (including both up- and down-regulation of genes) in each of these states is 
obtained. 

The identification of sequences that are differentially expressed in breast 
15 cancer versus non-breast cancer tissue allows the use of this information in a number of ways. 
For example, a particular treatment regime may be evaluated: does a chemotherapeutic drug 
act to down-regulate breast cancer, and thus tumor growth or recurrence, in a particular 
patient Similarly, diagnosis and treatment outcomes may Jbe done or confirmed by 
comparing patient samples with the known expression profiles. Metastatic tissue can also be 
20 analyzed to determine the stage of breast cancer in the tissue. Furthermore, these gene 
expression profiles (or individual genes) allow screening of drug candidates with an eye to 
mimicking or altering a particular expression profile; e.g., screening can be done for drugs 
that suppress the breast cancer expression profile. This may be done by making biochips 
comprising sets of the important breast cancer genes, which can then be used in these screens. 
25 These methods can also be done on the protein basis; that is, protein expression levels of the 
breast cancer proteins can be evaluated for diagnostic purposes or to screen candidate agents. 
In addition, the breast cancer nucleic acid sequences can be administered for gene therapy 
purposes, including the administration of antisense nucleic acids, or the breast cancer proteins 
(including antibodies and other modulators thereof) administered as therapeutic drugs. 
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Thus the present invention provides nucleic acid and protein sequences that 
are differentially expressed in breast cancer, herein termed "breast cancer sequences." As 
outlined below, breast cancer sequences include those that are up-regulated (i.e., expressed at 
a higher level) in breast cancer, as well as those that are down-regulated (i.e., expressed at a 
5 lower level). In a preferred embodiment, the breast cancer sequences are from humans; 
however, as will be appreciated by those in the art, breast cancer sequences from other 
organisms may be useful in animal models of disease and drug evaluation; thus, other breast 
cancer sequences are provided, from vertebrates, including mammals, including rodents (rats, 
mice, hamsters, guinea pigs, etc.), primates, farm animals (including sheep, goats, pigs, cows, 

10 horses, etc.) and pets, e.g., (dogs, cats, etc.). Breast cancer sequences from other organisms 
may be obtained using the techniques outlined below. 

Breast cancer sequences can include both nucleic acid and amino acid 
sequences. As will be appreciated by those in the art and is more fully outlined below, breast 
cancer nucleic acid sequences are useful in a variety of applications, including diagnostic 

15 applications, which will detect naturally occurring nucleic acids, as well as screening 
applications; e.g., biochips comprising nucleic acid probes or PGR microtiter plates with 
selected probes to the breast cancer sequences can be generated 

A breast cancer sequence can be initially identified by substantial nucleic acid 
and/or amino acid sequence homology to the breast cancer sequences outlined herein Such 

20 homology can be based upon the overall nucleic acid or amino acid sequence, and is 

generally determined as outlined below, using either homology programs or hybridization 
conditions. 

For identifying breast cancer-associated sequences, the breast cancer screen 
typically includes comparing genes identified in different tissues, e.g., normal and cancerous 
25 tissues, or tumor tissue samples from patients who have metastatic disease vs. non metastatic 
tissue. Other suitable tissue comparisons include comparing breast cancer samples with 
metastatic cancer samples from other cancers, such as lung, breast, gastrointestinal cancers, 
ovarian, etc. Samples of different stages of breast cancer, e.g., survivor tissue, drug resistant 
states, and tissue undergoing metastasis, are applied to biochips comprising nucleic acid 
30 probes. The samples are first microdissected, if applicable, and treated as is known in the art 
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for the preparation ofmRNA. Suitable biochips are commercially available, e.g. from 
Afiymetrix. Gene expression profiles as described herein are generated and the data 
analyzed. 

In one embodiment, the genes showing changes in expression as between 

5 normal and disease states are compared to genes expressed in other normal tissues, preferably 
normal breast, but also including, and not limited to lung, heart, brain, liver, breast, kidney, 
muscle, colon, small intestine, large intestine, spleen, bone and placenta. In a preferred 
embodiment, those genes identified during the breast cancer screen that are expressed in any 
significant amount in other tissues are removed from the profile, although in some 

1 0 embodiments, this is not necessary. That is, when screening for drags, it is usually preferable 
that the target be disease specific, to minimize possible side effects. 

In a preferred embodiment, breast cancer sequences are those that are up- 
regulated in breast cancer, that is, the expression of these genes is higher in the breast cancer 
tissue as compared to non-cancerous tissue. <c Up-regulation" as used herein often means at 

1 5 least about a two-fold change, preferably at least about a three fold change, with at least about 
five-fold or higher being preferred. All unigene cluster identification numbers and accession 
numbers herein are for the GenBank sequence database and the sequences of the accession 
numbers are hereby expressly incorporated by reference. GenBank is known in the art, see, 
eg., Benson, DA, et aL> Nucleic Acids Research 26:1-7 (1998) and 

20 httpyAvwjicbi.nlnLnih.gov/. Sequences are also available in other databases, e.g., 

European Molecular Biology Laboratory (EMBL) and DNA Database of Japan pDBJ). 
U.S. Patent Application N. 09/687,576, with the same assignee as the present application, 
further discloses related sequences, compositions, and methods of diagnosis and treatment of 
breast cancer is hereby expressly incorporated by reference. 

25 In another preferred embodiment, breast cancer sequences are those that are 

down-regulated in the breast cancer; that is, the expression of these genes is lower in breast 
cancer tissue as compared to non-cancerous tissue (see, Tables 1,2, 3, 15, 16 etc.). 
'Down-regulation" as used herein often means at least about a two-fold change, preferably at 
least about a three fold change, with at least about five-fold or higher being preferred 
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Informatics 

The ability to identify genes that are over or under expressed in breast cancer 
can additionally provide high-resolution, high-sensitivity datasets which can be used in the 

5 areas of diagnostics, therapeutics, drug development, pharmacogenetics, protein structure, 
biosensor development, and other related areas. For example, the expression profiles can be 
used in diagnostic or prognostic evaluation of patients with breast cancer. Or as another 
example, subcellular toxicological information can be generated to better direct drug structure 
and activity correlation (see Anderson, Pharmaceutical Proteomics r Targets, Mechanism, 

10 and Function, paper presented at the BC Proteomics conference, Coronado, CA (June 1 1-12, 
1998)). Subcellular toxicological information can also be utilized in a biological sensor 
device to predict the likely toxicological effect of chemical exposures and likely tolerable 
exposure thresholds (see U.S. Patent No. 5,81 1,231). Similar advantages accrue from 
datasets relevant to other biomolecules and bioactive agents (e.g., nucleic acids, saccharides, 

1 5 lipids, drugs, and the like). 

Thus, in another embodiment, the present invention provides a database that 
includes at least one set of assay data. The data contained in the database is acquired, e.g., 
using array analysis either singly or in a library format. The database can be in substantially 
any form in which data can be maintained and transmitted, but is preferably an electronic 

20 database. The electronic database of the invention can be maintained on any electronic 
device allowing for the storage of and access to the database, such as a personal computer, 
but is preferably distributed on a wide area network, such as the World Wide Web. 

The focus of the present section on databases that include peptide sequence 
data is for clarity of illustration only. It will be apparent to those of skill in the art that similar 

25 databases can be assembled for any assay data acquired using an assay of the invention. 

The compositions and methods for identifying and/or quantitating the relative 
and/or absolute abundance of a variety of molecular and macromolecular species from a 
biological sample undergoing breast cancer, i.e., the identification of breast cancer-associated 
sequences described herein, provide an abundance of information, which can be correlated 

30 with pathological conditions, predisposition to disease, drug testing, therapeutic monitoring, 
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gene-disease causal linkages, identification of correlates of immunity and physiological 
status, among others. Although the data generated from the assays of the invention is suited 
for manual review and analysis, in a preferred embodiment, prior data processing using high- 
speed computers is utilized. 

5 An array of methods for indexing and retrieving biomolecular information is 

known in the art For example, U.S. Patents 6,023,659 and 5,966,712 disclose a relational 
database system for storing biomolecular sequence information in a manner that allows 
sequences to be catalogued and searched according to one or more protein function 
hierarchies. U.S. Patent 5,953,727 discloses a relational database having sequence records 

1 0 containing information in a format that allows a collection of partial-length DNA sequences 
to be catalogued and searched according to association with one or more sequencing projects 
for obtaining full-length sequences from the collection of partial length sequences. U.S. 
Patent 5,706,498 discloses a gene database retrieval system for making a retrieval of a gene 
sequence similar to a sequence data item in a gene database based on the degree of similarity 

1 5 between a key sequence and a target sequence. U.S. Patent 5,538,897 discloses a method 

using mass spectroscopy fragmentation patterns of peptides to identify amino acid sequences 
in computer databases by comparison of predicted mass spectra with experimentally-derived 
mass spectra using a closeness-of-fit measure. U.S. Patent 5,926,81 8 discloses a multi- 
dimensional database comprising a functionality for multi-dimensional data analysis 

20 described as on-line analytical processing (OLAP), which entails the consolidation of 

projected and actual data according to more than one consolidation path or dimension. U.S. 
Patent 5,295,261 reports a hybrid database structure in which the fields of each database 
record are divided into two classes, navigational and informational data, with navigational 
fields stored in a hierarchical topological map which can be viewed as a tree structure or as 

25 the merger of two or more such tree structures. 

See also Mount et al, Bioinformatics (2001); Biological Sequence Analysis: 
Probabilistic Models of Proteins and Nucleic Acids (Durbin et al, eds., 1999); 
Bioinformatics: A Practical Guide to the Analysis of Genes and Proteins (Baxevanis & 
Oeullette eds., 1998)); Rashidi & Buehler, Bioinformatics: Basic Applications in Biological 

30 Science and Medicine (1999); Introduction to Computational Molecular Biology (Setubal et 
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al. y eds 1997); Bioinformatics: Methods and Protocols (Misener & Krawetz, eds, 2000); 
Bioinformatics: Sequence, Structure, and Databanks: A Practical Approach (Higgins & 
Taylor, eds., 2000); Brown, Bioinformatics: A Biologist 's Guide to Biocomputing and the 
Internet (2001); Han & Kamber, Data Mining: Concepts and Techniques (2000); and 

5 Waterman, Introduction to Computational Biology: Maps, Sequences, and Genomes (1995). 

The present invention provides a computer database comprising a computer 
and software for storing in computer-retrievable form assay data records cross-tabulated, e.g., 
with data specifying the source of the target-containing sample from which each sequence 
specificity record was obtained. 

10 la an exemplary embodiment, at least one of the sources of target-containing 

sample is from a control tissue sample known to be free of pathological disorders. In a 
variation, at least one of the sources is a known pathological tissue specimen, e.g., a 
neoplastic lesion or another tissue specimen to be analyzed for breast cancer. In another 
variation, the assay records cross-tabulate one or more of the following parameters for each 

15 target species in a sample: (1) a unique identification code, which can include, e.g., a target 
molecular structure and/or characteristic separation coordinate (e.g., electrophoretic 
coordinates); (2) sample source; and (3) absolute and/or relative quantity of the target species 
present in the sample. / 

The invention also provides for the storage and retrieval of a collection of 

20 target data in a computer data storage apparatus, which ca^ include magnetic disks, optical 
disks, magneto-optical disks, DRAM, SRAM, SGRAM, SDRAM, RDRAM, DDR RAM, 
magnetic bubble memory devices, and other data storage devices, including CPU registers 
and on-CPU data storage arrays. Typically, the target data records are stored as a bit pattern 
in an array of magnetic domains on a magnetizable medium or as an array of charge states or 

25 transistor gate states, such as an array of cells in a DRAM device (e.g., each cell comprised of 
a transistor and a charge storage area, which maybe on the transistor). In one embodiment, 
the invention provides such storage devices, and computer systems built therewith, 
comprising a bit pattern encoding a protein expression fingerprint record comprising unique 
identifiers for at least 10 target data records cross-tabulated with target source. 
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When the target is a peptide or nucleic acid, the invention preferably provides 
a method for identifying related peptide or nucleic acid sequences, comprising performing a 
computerized comparison between a peptide or nucleic acid sequence assay record stored in 
or retrieved from a computer storage device or database and at least one other sequence. The 
5 comparison can include a sequence analysis or comparison algorithm or computer program 
embodiment thereof (e.g., FASTA, TFASTA, GAP, BESTFTT) and/or the comparison may 
be of the relative amount of a peptide or nucleic acid sequence in a pool of sequences 
determined from a polypeptide or nucleic acid sample of a specimen. 

The invention also preferably provides a magnetic disk, such as an IBM- 
10 compatible (DOS, Windows, Windows95/98/2000, Windows NT, OS/2) or other format 

(e.g., Linux, SunOS, Solaris, ADC, SCO Unix, VMS, MV, Macintosh, etc.) floppy diskette or 
hard (fixed, Winchester) disk drive, comprising a bit pattern encoding data from an assay of 
the invention in a file format suitable for retrieval and processing in a computerized sequence 
analysis, comparison, or relative quantitation method. 
1 5 The invention also provides a network, comprising a plurality of computing 

devices linked via a data Knlr, such as an Ethernet cable (coax or lOBaseT), telephone line, 
ISDN line, wireless network, optical fiber, or other suitable signal transmission medium, 
whereby at least one network device (e.g., computer, disk array, etc.) comprises a pattern of 
magnetic domains (e.g., magnetic disk) and/or charge domains (e.g., an array of DRAM 
20 cells) composing a bit pattern encoding data acquired from an assay of the invention. 

The invention also provides a method for transmitting assay data that includes 
generating an electronic signal on an electronic communications device, such as a modem, 
ISDN terminal adapter, DSL, cable modem, ATM switch, or the like, wherein the signal 
includes (in native or encrypted format) a bit pattern encoding data from an assay or a 
25 database comprising a plurality of assay results obtained by the method of the invention. 

In a preferred embodiment, the invention provides a computer system for 
comparing a query target to a database containing an array of data structures, such as an assay 
result obtained by the method of the invention, and ranking database targets based on the 
* degree of identity and gap weight to the target data. A central processor is preferably 
30 initialized to load and execute the computer program for alignment and/or comparison of the 
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assay results. Data for a query target is entered into the central processor via an I/O device. 
Execution of the computer program results in the central processor retrieving the assay data 
from the data file, which comprises a binary description of an assay result 

The target data or record and the computer program can be transferred to 

5 secondary memory, which is typically random access memory (e.g., DRAM, SRAM, 
SGRAM, or SDRAM). Targets are ranked according to the degree of correspondence 
between a selected assay characteristic (e.g., binding to a selected affinity moiety) and the 
same characteristic of the query target and results are output via an I/O device. For example, 
a central processor can be a conventional computer (e.g., Intel Pentium, PowerPC, Alpha, 

10 PA-8000, SPARC, MIPS 4400, MIPS 10000, VAX, etc.); a program can be a commercial or 
public domain molecular biology software package (e.g., UWGCG Sequence Analysis 
Software, Darwin); a data file can be an optical or magnetic disk, a data server, a memory 
device (e.g., DRAM, SRAM, SGRAM, SDRAM, EPROM, bubble memory, flash memory, 
etc.); an I/O device can be a terminal comprising a video display and a keyboard, a modem, 

15 an ISDN terminal adapter, an Ethernet port, a punched card reader, a magnetic strip reader, or 
other suitable I/O device. 

The invention also preferably provides the use of a computer system, such as 
that described above, which comprises: (1) a computer, (2) a stored bit pattern encoding a 
collection of peptide sequence specificity records obtained by the methods of the invention, 

20 which may be stored in the computer, (3) a comparison target, such as a query target; and (4) 
a program for alignment and comparison, typically with rank-ordering of comparison results 
on the basis of computed similarity values. 

Characteristics of breast cancer-associated proteins 

25 Breast cancer proteins of the present invention may be classified as secreted 

proteins, transmembrane proteins or intracellular proteins. In one embodiment, the breast 
cancer protein is an intracellular protein. Intracellular proteins may be found in the 
cytoplasm and/or in the nucleus. Intracellular proteins are involved in all aspects of cellular 
function and replication (including, e.g., signaling pathways); aberrant expression of such 

30 proteins often results in unregulated or disregulated cellular processes (see, eg., Molecular 
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Biology of the Cell (Alberts, e&, 3rd ed., 1994). For example, many intracellular proteins 
have enzymatic activity such as protein kinase activity, protein phosphatase activity, protease 
activity, nucleotide cyclase activity, polymerase activity and the like. Intracellular proteins 
also serve as docking proteins that are involved in organizing complexes of proteins, or 
5 targeting proteins to various subcellular localizations, and are involved in maintaining the 
structural integrity of organelles. 

An increasingly appreciated concept in characterizing proteins is the presence 
in the proteins of one or more motifs for which defined functions have been attributed. In 
addition to the highly conserved sequences found in the enzymatic domain of proteins, highly 

1 0 conserved sequences have been identified in proteins that are involved in protein-protein 
interaction. For example, Src-homology-2 (SH2) domains bind tyrosine-phosphorylated 
targets in a sequence dependent maimer. PTB domains, which are distinct from SH2 
domains, also bind tyrosine phosphorylated targets. SID domains bind to proline-rich 
targets. In addition, PH domains, tetratricopeptide repeats and WD domains to name only a 

1 5 few, have been shown to mediate protein-protein interactions. Some of these may also be 
involved in binding to phospholipids or other second messengers. As will be appreciated by 
one of ordinary skill in the art, these motifs can be identified on the basis of primary 
sequence; thus, an analysis of the sequence of proteins may provide insight into both the 
enzymatic potential of the molecule and/or molecules with which the protein may associate. 

20 One useful database is Pfam (protein families), which is a large collection of multiple 

sequence alignments and hidden Markov models covering many common protein domains. 
Versions are available via the internet from Washington University in St. Louis, the Sanger 
Center in England, and the Karolinska Institute in Sweden {see, eg., Bateman et al, Nuc 
Acids Res. 28:263-266 (2000); Sonnhammer et aL, Proteins 28:405-420 (1997); Bateman et 

25 a/., Nuc Acids Res. 27:260-262 (1 999); and Sonnhammer et al, Nuc Acids Res. 26:320-322- 
(1998)). 

In another embodiment, the breast cancer sequences are transmembrane 
. proteins. Transmembrane proteins arc molecules that span a phospholipid bilayer of a cell. 
They may have an intracellular domain, an extracellular domain, or both. The intracellular 
30 domains of such proteins may have a number of functions including those already described 
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for intracellular proteins. For example, the intracellular domain may have enzymatic activity 
and/or may serve as a binding site for additional proteins. Frequently the intracellular 
domain of transmembrane proteins serves both roles. For example certain receptor tyrosine 
kinases have both protein kinase activity and SH2 domains. In addition, autophosphorylation 
5 of tyrosines on the receptor molecule itself, creates binding sites for additional SHE domain 
containing proteins. 

Transmembrane proteins may contain from one to many transmembrane 
domains. For example, receptor tyrosine kinases, certain cytokine receptors, receptor 
guanylyl cyclases and receptor serine/threonine protein kinases contain a single 

10 transmembrane domain. However, various other proteins including channels and adenylyl 
cyclases contain numerous transmembrane domains. Many important cell surface receptors 
such as G protein coupled receptors (GPCRs) are classified as "seven transmembrane 
domain" proteins, as they contain 7 membrane spanning regions. Characteristics of 
transmembrane domains include approximately 20 consecutive hydrophobic amino acids that 

15 may be followed by charged amino acids. Therefore, upon analysis of the amino acid 
sequence of a particular protein, the localization and number of transmembrane domains 
within the protein may be predicted (see, e.g. PSORT web site http://psortnibb.ac.jp/). 
Important transmembrane protein receptors include, but are not limited to the insulin 
receptor, insulin-like growth factor receptor, human growth hormone receptor, glucose 

20 transporters, transferrin receptor, epidermal growth factor receptor, low density lipoprotein 
receptor, epidermal growth factor receptor, leptin receptor, interleukin receptors, e.g. DL-1 
receptor, IL-2 receptor, 

The extracellular domains of transmembrane proteins are diverse; however, 
conserved motifs are found repeatedly among various extracellular domains. Conserved 

25 structure and/or functions have been ascribed to different extracellular motifs. Many 

extracellular domains are involved in binding to other molecules. In one aspect, extracellular 
domains are found on receptors. Factors that bind the receptor domain include circulating 
ligands, which may be peptides, proteins, or small molecules such as adenosine and the like. 
For example, growth factors such as EGF, FGF and PDGF are circulating growth factors that 

30 bind to their cognate receptors to initiate a variety of cellular responses. Other factors include 
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cytokines, autogenic factors, neurotrophic factors and the like. Extracellular domains also 
bind to cell-associated molecules. In this respect, they mediate cell-cell interactions. Cell- 
associated ligands can be tethered to the cell, e.g., via a glycosylphosphatidylinositol (GPI) 
anchor, or may themselves be transmembrane proteins. Extracellular domains also associate 

5 with the extracellular matrix and contribute to the maintenance of the cell structure. 

Breast cancer proteins that are transmembrane are particularly preferred in the 
present invention as they are readily accessible targets for immunotherapeutics, as are 
described herein. In addition, as outlined below, transmembrane proteins can be also useful 
in imaging modalities. Antibodies may be used to label such readily accessible proteins in 

10 situ. Alternatively, antibodies can also label intracellular proteins, in which case samples are 
typically permeablized to provide access to intracellular proteins. 

It will also be appreciated by tho se in the art that a transmembrane protein can 
be made soluble by removing transmembrane sequences, e.g., through recombinant methods. 
Furthermore, transmembrane proteins that have been made soluble can be made to be 

1 5 secreted through recombinant means by adding an appropriate signal sequence. 

In another embodiment, the breast cancer proteins are secreted proteins; the 
secretion of which can be either constitutive or regulated. These proteins have a signal 
peptide or signal sequence that targets the molecule to the secretory pathway. Secreted 
proteins are involved in numerous physiological events; by virtue of their circulating nature, 

20 they serve to transmit signals to various other cell types. Tjhe secreted protein may function in 
an autocrine manner (acting on the cell that secreted the factor), a paracrine manner (acting 
on cells in close proximity to the cell that secreted the factor) or an endocrine manner (acting 
on cells at a distance). Thus secreted molecules find use in modulating or altering numerous 
aspects of physiology. Breast cancer proteins that are secreted proteins are particularly 

25 preferred in the present invention as they serve as good targets for diagnostic markers, e.g., 
for blood, plasma, serum, or stool tests. 

Use of breast cancer nucleic acids 

As described above, breast cancer sequence is initially identified by 
30 substantial nucleic acid and/or amino acrid sequence homology or linkage to the breast cancer 
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sequences outlined herein. Such homology can be based upon the overall nucleic acid or 
amino acid sequence, and is generally determined as outlined below, using either homology 
programs or hybridization conditions. Typically, linked sequences on a mRNA are found on 
the same molecule. 

5 The breast cancer nucleic acid sequences of the invention, e.g., the sequences 

in Tables 1-25, can be fragments of larger genes, i.e., they are nucleic acid segments. 
"Genes" in this context includes coding regions, non-coding regions, and mixtures of coding 
and non-coding regions. Accordingly, as will be appreciated by those in the art, using the 
sequences provided herein, extended sequences, in either direction, of the breast cancer genes 

10 can be obtained, using techniques well known in the art for cloning either longer sequences or 
the full length sequences; see Ausubel, et al 9 supra. Much can be done by informatics and 
many sequences can be clustered to include multiple sequences corresponding to a single 
gene, e.g., systems such as UniGene (see, http^Avww.ncbi.nhn.nih.govAJniGene/)- 

Qnce the breast cancer nucleic acid is identified, it can be cloned and, if 

1 5 necessary, its constituent parts recombined to form the entire breast cancer nucleic acid 
coding regions or the entire mRNA sequence. Once isolated from its natural source, e.g., 
contained within a plasmid or other vector or excised therefrom as a linear nucleic acid 
segment, the recombinant breast cancer nucleic acid can be further-used as a probe to identify 
and isolate other breast cancer nucleic acids, e.g., extended coding regions. It can also be 

20 used as a "precursor" nucleic acid to make modified or variant breast cancer nucleic acids 
and proteins. 

The breast cancer nucleic acids of the present invention are used in several 
ways. In a first embodiment, nucleic acid probes to the breast cancer nucleic acids are made 
and attached to biochips to be used in screening and diagnostic methods, as outlined below, 
25 or for administration, e.g., for gene therapy, vaccine, and/or antisense applications. 

Alternatively, the breast cancer nucleic acids that include coding regions of breast cancer 
proteins can be put into expression vectors for the expression of breast cancer proteins, again 
for screening purposes or for administration to a patient 

In a preferred embodiment, nucleic acid probes to breast cancer nucleic acids 
30 (both the nucleic acid sequences outlined in the figures and/or the complements thereof) are 
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made. The nucleic acid probes attached to the biochip are designed to be substantially 
complementary to the breast cancer nucleic acids, i.e. the target sequence (either the target 
sequence of the sample or to other probe sequences, e.g., in sandwich assays), such that 
hybridization of the target sequence and the probes of the present invention occurs. As 
5 outlined below, this complementarity need not be perfect; there may be any number of base 
pair mismatches which will interfere with hybridization between the target sequence and the 
single stranded nucleic acids of the present invention. However, if the number of mutations 
is so great that no hybridization can occur under even the least stringent of hybridization 
conditions, the sequence is not a complementary target sequence. Thus, by "substantially 

1 0 complementary" herein is meant that the probes are sufficiently complementary to the target 
sequences to hybridize under normal reaction conditions, particularly high stringency 
conditions, as outlined herein. 

A nucleic acid probe is generally single stranded but can be partially single 
and partially double stranded. The strandedness of the probe is dictated by the structure, 

1 5 composition, and properties of the target sequence. In general, the nucleic acid probes range 
from about 8 to about 100 bases long, with from about 10 to about 80 bases being preferred, 
and from about 30 to about 50 bases being particularly preferred. That is, generally whole 
genes are not used In some embodiments, much longer nucleic acids can be used, up to 
hundreds of bases. 

20 In a preferred embodiment, more than one probe per sequence is used, with 

either overlapping probes or probes to different sections of the target being used. That is, 
two, three, four or more probes, with three being preferred, are used to build in a redundancy 
for a particular target The probes can be overlapping (i.e., have some sequence in common), 
or separate. In some cases, PCR primers may be used to amplify signal for higher sensitivity. 
25 As will be appreciated by those in the art, nucleic acids can be attached or 

immobilized to a solid support in a wide variety of ways. By "immobilized" and grammatical 
equivalents herein is meant the association or binding between the nucleic acid probe and the 
solid support is sufficient to be stable under the conditions of binding, washing, analysis, and 
removal as outlined below. The binding can typically be covalent or non-covalent By "non- 
30 covalent binding" and grammatical equivalents herein is meant one or more of electrostatic, 
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hydrophilic, and hydrophobic interactions. Included in non-covalent binding is the covalent 
attachment of a molecule, such as, streptavidin to the support and the non-covalent binding of 
the biotinylated probe to the streptavidin. By "covalent binding" and grammatical 
equivalents herein is meant that the two moieties, the solid support and the probe, are 

5 attached by at least one bond, including sigma bonds, pi bonds and coordination bonds. 
Covalent bonds can be formed directly between the probe and the solid support or can be 
formed by a cross linker or by inclusion of a specific reactive group on either the solid 
support or the probe or both molecules. Immobilization may also involve a combination of 
covalent and non-covalent interactions. 

10 In general, the probes are attached to the biochip in a wide variety of ways, as 

will be appreciated by those in the art As described herein, the nucleic acids can either be 
synthesized first, with subsequent attachment to the biochip, or can be directly synthesized on 
the biochip. 

The biochip comprises a suitable solid substrate. By "substrate" or "solid 
15 support" or other grammatical equivalents herein is meant a material that can be modified to 
contain discrete individual sites appropriate for the attachment or association of the nucleic 
acrid probes and is amenable to at least one detection method. As will be appreciated by those 
in the art, the number of possible substrates are very large, and include, but are not limited to, 
glass and modified or functionalized glass, plastics (including acrylics, polystyrene and 
20 copolymers of styrene and other materials, polypropylene, polyethylene, polybutylene, 

polyurethanes, TeflonJ, etc.), polysaccharides, nylon or nitrocellulose, resins, silica or silica- 
based materials including silicon and modified silicon, carbon, metals, inorganic glasses, 
plastics, etc. In general, the substrates allow optical detection and do not appreciably 
fluoresce. A preferred substrate is described in copending application entitled Reusable Low 
25 Fluorescent Plastic Biochip, U.S. Application Serial No. 09/270,214, filed March 15, 1999, 
herein incorporated by reference in its entirety. 

Generally the substrate is planar, although as will be appreciated by those in 
the art, other configurations of substrates may be used as well. For example, the probes may 
be placed on the inside surface of a tube, for flow-through sample analysis to minimize 
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sample volume. Similarly, the substrate may be flexible, such as a flexible foam, including 
closed cell foams made of particular plastics. 

In a preferred embodiment, the surface of the biochip and the probe may be 
derivatized with chemical functional groups for subsequent attachment of the two. Thus, e.g., 

5 the biochip is derivatized with a chemical functional group including, but not limited to,' 
amino groups, carboxy groups, oxo groups and thiol groups, with amino groups being 
particularly preferred. Using these functional groups, the probes can be attached using 
functional groups on the probes. For example, nucleic acids containing amino groups can be 
attached to surfaces comprising amino groups, e.g. using linkers as are known in the art; e.g., 

10 homo-or hetero-bifunctional linkers as are well known (see 1994 Pierce Chemical Company 
catalog, technical section on cross-linkers, pages 155-200). In addition, in some cases, 
additional linkers, such as alkyi groups (including substituted and heteroalkyl groups) may be 
used. 

In this embodiment, oligonucleotides are synthesized as is known in the art, 
1 5 and then attached to the surface of the solid support. As will be appreciated by those skilled 
in the art, either the 5* or 3* terminus may be attached to the solid support, or attachment may 
be via an internal nucleoside. 

In another embodiment, the immobilization to the solid support may be very 
strong, yet non-covalent. For example, biotinylated oligonucleotides can be made, which 
20 bind to surfaces covalently coated with streptavidin, resulting in attachment. 

Alternatively, the oligonucleotides may be synthesized on the surface, as is 
known in the art. For example, photoactivation techniques utilizing photopolymerization 
compounds and techniques are used. In a preferred embodiment, the nucleic acids can be 
synthesized in situ, using well known photolithographic techniques, such as those described 
25 in WO 95/251 16; WO 95/35505; U.S. Patent Nos. 5,700,637 and 5,445,934; and references 
cited within, all of which are expressly incorporated by reference; these methods of 
attachment form the basis of the Affimetrix GeneChip™ technology. 

Often, amplification-based assays are performed to measure the expression 
level of breast cancer-associated sequences. These assays are typically performed in 
30 conjunction with reverse transcription. In such assays, a breast cancer-associated nucleic acid 
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sequence acts as a template in an amplification reaction (e.g., Polymerase Chain Reaction, or 
PCR). In a quantitative amplification, the amount of amplification product will be 
proportional to the amount of template in the original sample. Comparison to appropriate 
controls provides a measure of the amount of breast cancer-associated RNA. Methods of 

5 quantitative amplification are well known to those of skill in the art Detailed protocols for 
quantitative PCR are provided, e.g., in Innis et al, PCR Protocols, A Guide to Methods and 
Applications (1990). 

In some embodiments, a TaqMan based assay is used to measure expression. 
TaqMan based assays use a fluorogenic oligonucleotide probe that contains a 5' fluorescent 

10 dye and a 3* quenching agent. The probe hybridizes to a PCR product, but cannot itself be 
extended due to a blocking agent at the 3' end. When the PCR product is amplified in 
subsequent cycles, the 5' nuclease activity of the polymerase, e.g., AmpliTaq, results in the 
cleavage of the TaqMan probe. This cleavage separates the 5' fluorescent dye and the 3' 
quenching agent, thereby resulting in an increase in fluorescence as a function of 

15 amplification (see, e.g., literature provided by Peririn-Elmer, e.g. y www2.peririn-elmer.com). 

Other suitable amplification methods include, but are not limited to, ligase 
chain reaction (LCR) (see Wu & Wallace, Genomics 4:560 (1989), Landegren et al, Science 
241:1077 (1988), and Barringer et al, Gene 89:117 (1990)), transcription amplification 
(Kwohefa/., Proa Natl Acad, Set USA 86:1173 (1989)), self-sustained sequence replication 

20 (Guatelli et al, Proc. Nat Acad. Sci. USA 87:1874 (1990^, dot PCR, and linker adapter PGR, 
etc. 

Expression of breast cancer proteins from nucleic acids 

In a preferred embodiment, breast cancer nucleic acids, e.g., encoding breast 
25 cancer proteins are used to make a variety of expression vectors to express breast cancer 

proteins which can then be used in screening assays, as described below. Expression vectors 
and recombinant DNA technology are well known to those of skill in. the art (see, eg., 
Ausubel, supra, and Gene Expression Systems (Fernandez & Hoeffler, eds, 1999)) and are 
used to express proteins. The expression vectors may be either self-replicating 
30 extrachromosomal vectors or vectors which integrate into a host genome. Generally, these 

43 



WO 02/059377 



PCI7US02/02242 



expression vectors include transcriptional and translational regulatory nucleic acid operably 
linked to the nucleic acid encoding the breast cancer protein. The term "control sequences" 
refers to DNA sequences used for the expression of an operably linked coding sequence in a 
particular host organism. Control sequences that are suitable for prokaryotes, e.g., include a 

5 promoter, optionally an operator sequence, and a ribosome binding site. Eukaryotic cells are 
known to utilize promoters, polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional 
relationship with another nucleic acid sequence. For example, DNA for a presequence or 
secretory leader is operably linked to DNA for a polypeptide if it is expressed as a preprotein 

10 that participates in the secretion of the polypeptide; a promoter or enhancer is operably linked 
to a coding sequence if it affects the transcription of the sequence; or a ribosome binding site 
is operably linked to a coding sequence if it is positioned so as to facilitate translation. 
Generally, "operably linked" means that the DNA sequences being linked are contiguous, 
and, in the case of a secretory leader, contiguous and in reading phase. However, enhancers 

15 do not have to be contiguous. Linking is typically accomplished by ligation at convenient 
restriction sites. If such sites do not exist, synthetic oligonucleotide adaptors or linkers are 
used in accordance with conventional practice. Transcriptional and translational regulatory 
nucleic acid will generally be appropriate to the host cell used to express the breast cancer 
protein. Numerous types of appropriate expression vectors, and suitable regulatory 

20 sequences are known in the art for a variety of host cells. 

In general, transcriptional and translational regulatory sequences may include, 
but are not limited to, promoter sequences, ribosomal binding sites, transcriptional start and 
stop sequences, translational start and stop sequences, and enhancer or activator sequences. 
In a preferred embodiment, the regulatory sequences include a promoter and transcriptional 

25 start and stop sequences. 

Promoter sequences encode either constitutive or inducible promoters. The 
promoters may be either naturally occurring promoters or hybrid promoters. Hybrid 
promoters, which combine elements of more than one promoter, are also known in the art, 
and are useful in the present invention. 
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In addition, an expression vector may comprise additional elements. For 
example, the expression vector may have two replication systems, thus allowing it to be 
maintained in two organisms, e.g. in mammalian or insect cells for expression and in a 
procaryotic host for cloning and amplification. Furthermore, for integrating expression 
5 vectors, the expression vector contains at least one sequence homologous to the host cell 
genome, and preferably two homologous sequences which flank die expression construct 
The integrating vector may be directed to a specific locus in the host cell by selecting the 
appropriate homologous sequence for inclusion in the vector. Constructs for integrating 
vectors are well known in the art (e.g., Fernandez & Hoeffler, supra). 
10 In addition, in a preferred embodiment, the expression vector contains a 

selectable marker gene to allow the selection of transformed host cells. Selection genes are 
well known in the art and will vary with the host cell used. 

The breast cancer proteins of the present invention are produced by culturing a 
host cell transformed with an expression vector containing nucleic acid encoding a breast 
1 5 cancer protein, under the appropriate conditions to induce or cause expression of the breast 
cancer protein. Conditions appropriate for breast cancer protein expression will vary with the 
choice of the expression vector and the host cell, and will be easily ascertained by one skilled 
in the art through routine experimentation or optimization. For example, the use of 
constitutive promoters in the expression vector will require optimizing the growth and 
20 proliferation of the host cell, while the use of an inducible promoter requires the appropriate 
growth conditions for induction. In addition, in some embodiments, the timing of the harvest 
is important For example, the baculoviral systems used in insect cell expression are lytic 
viruses, and thus harvest time selection can be crucial for product yield. 

Appropriate host cells include yeast, bacteria, archaebacteria, fungi, and insect 
25 and animal cells, including mammalian cells. Of particular interest are Saccharomyces 
cerevisiae and other yeasts, & coli, Bacillus subtilis, SB cells, C129 cells, 293 cells, 
Neurospora, BHK, CHO, COS, HeLa cells, HUVEC (human umbilical vein endothelial 
cells), THP1 cells (a macrophage cell line) and various other human cells and cell lines. 

In a preferred embodiment, the breast cancer proteins are expressed in 
30 mammalian cells. Mammalian expression systems are also known in the art, and include 
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retroviral and adenoviral systems. One expression vector system is a retroviral vector system 
such as is generally described in PCT/US97/01019 and PCT/US97/01048, both of which are 
hereby expressly incorporated by reference. Of particular use as mammalian promoters are 
the promoters from mammalian viral genes, since the viral genes are often highly expressed 
5 and have a broad host range. Examples include the SV40 early promoter, mouse mammary 
tumor virus LTR promoter, adenovirus major late promoter, herpes simplex virus promoter, 
and the CMV promoter (see, e.g., Fernandez & Hoeffler, supra). Typically, transcription 
termination and polyadenylation sequences recognized by mammalian cells are regulatory 
regions located 3* to the translation stop codon and thus, together with the promoter elements, 

10 flank the coding sequence. Examples of transcription terminator and polyadenlyation signals 
include those derived form SV40. 

The methods of introducing exogenous nucleic acid into mammalian hosts, as 
well as other hosts, is well known in the art, and will vary with the host cell used. 
Techniques include dextran-mediated transfection, calcium phosphate precipitation, 

1 5 polybrene mediated transfection, protoplast fusion, electroporation, viral infection, 

encapsulation of die polynucleotide^) in liposomes, and direct microinjection of the DNA 
into nuclei. 

In a preferred embodiment, breast cancer proteins are expressed in bacterial 
systems. Bacterial expression systems are well known in the art. Promoters from 

20 bacteriophage may also be used and are known in the art In addition, synthetic promoters 
and hybrid promoters are also useful; e.g., the tac promoter is a hybrid of the trp and lac 
promoter sequences. Furthermore, a bacterial promoter can include naturally occurring 
promoters of non-bacterial origin that have the ability to bind bacterial RNA polymerase and 
initiate transcription. In addition to a functioning promoter sequence, an efficient ribosome 

25 binding site is desirable. The expression vector may also include a signal peptide sequence 
that provides for secretion of the breast cancer protein in bacteria. The protein is either 
secreted into the growth media (gram-positive bacteria) or into the periplasmic space, located 
between the inner and outer membrane of the cell (gram-negative bacteria). The bacterial 
expression vector may also include a selectable marker gene to allow for the selection of 

30 bacterial strains that have been transformed. Suitable selection genes include genes which 
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reader the bacteria resistant to drugs such as ampicillin, chloramphenicol, erythromycin, 
kanamycin, neomycin and tetracycline. Selectable markers also include biosynthetic genes, 
such as those in the histidine, tryptophan and leucine biosynthetic pathways. These 
components are assembled into expression vectors. Expression vectors for bacteria are well 
5 known in the art, and include vectors for Bacillus subtilis, E. coli, Streptococcus cremoris, 
and Streptococcus lividans, among others (e.g., Fernandez & Hoeffler, supra). The bacterial 
expression vectors are transformed into bacterial host cells using techniques well known in 
the art, such as calcium chloride treatment, electroporation, and others. 

In one embodiment, breast cancer proteins are produced in insect cells. 

10 Expression vectors for the transformation of insect cells, and in particular, baculovirus-based 
expression vectors, are well known in the art 

In a preferred embodiment, breast cancer protein is produced in yeast cells. 
Yeast expression systems are well known in the art, and include expression vectors for 
Saccharomyces cerevisiae, Candida albicans and C. maltosa, Hansenula polymorpha, 

1 5 Kluyveromyces fragilis and K lactis, Pichia guillerimondii and P. pastoris, 
Schizosaccharomyces pombe, and Yarrowia lipotytica. 

The breast cancer protein may also be made as a fusion protein, using 
techniques well known in the art Thus, e.g., for the creation of monoclonal antibodies, if the 
desired epitope is small, the breast cancer protein may be fused to a carrier protein to form an 

20 immunogen. Alternatively, the breast cancer protein may be made as a fusion protein to 
increase expression, or for other reasons. For example, when the breast cancer protein is a 
breast cancer peptide, the nucleic acid encoding the peptide may be linked to other nucleic 
acid for expression purposes. 

In a preferred embodiment, the breast cancer protein is purified or isolated 

25 after expression. Breast cancer proteins may be isolated or purified in a variety of ways 
known to those skilled in the art depending on what other components are present in the 
sample. Standard purification methods include electrophoretic, molecular, immunological 
and chromatographic techniques, including ion exchange, hydrophobic, affinity, and reverse- 
phase HPLC chromatography, and chromatofocusing. For example, the breast cancer protein 

30 may be purified using a standard anti-breast cancer protein antibody column. Ultrafiltration 
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and diafiltration techniques, in conjunction with protein concentration, are also useful. For 
general guidance in suitable purification techniques, see Scopes, Protein Purification (1982). 
The degree of purification necessary will vary depending on the use of the breast cancer 
protein. In some instances no purification will be necessary. 
5 Once expressed and purified if necessary, the breast cancer proteins and 

nucleic acids are useful in a number of applications. They may be used as immunoselection 
reagents, as vaccine reagents, as screening agents, etc. 

Variants of breast cancer proteins 

10 In one embodiment, the breast cancer proteins are derivative or variant breast 

cancer proteins as compared to the wild-type sequence. That is, as outlined more fully below, 
the derivative breast cancer peptide will often contain at least one amino acid substitution, 
deletion or insertion, with amino acid substitutions being particularly preferred. The amino 
acid substitution, insertion or deletion may occur at any residue within the breast cancer 

15 peptide. 

Also included within one embodiment of breast cancer proteins of the present 
invention are amino acid sequence variants. These variants typically fall into one or more of 
three classes: substitutional, insertional or deletional variants. These variants ordinarily are 
prepared by site specific mutagenesis of nucleotides in the DNA encoding the breast cancer 

20 protein, using cassette orPCR mutagenesis or other techniques well known in the art, to 
produce DNA encoding the variant, and thereafter expressing the DNA in recombinant cell 
culture as outlined above. However, variant breast cancer protein fragments having up to 
about 100-150 residues may be prepared by in vitro synthesis using established techniques. 
Amino acid sequence variants are characterized by the predetermined nature of the variation, 

25 a feature that sets them apart from naturally occurring allelic or interspecies variation of the 
breast cancer protein amino acid sequence. The variants typically exhibit the same 
qualitative biological activity as the naturally occurring analogue, although variants can also 
be selected which have modified characteristics as will be more fully outlined below. 

While the site or region for introducing an amino acid sequence variation is 

30 predetermined, the mutation per se need not be predetermined. For example, in order to 
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optimize the performance of a mutation at a given site, random mutagenesis may be 
conducted at the target codon or region and the expressed breast cancer variants screened for 
the optimal combination of desired activity. Techniques for making substitution mutations at 
predetermined sites in DNA having a known sequence are well known, e.g., Ml 3 primer 

5 mutagenesis and PCR mutagenesis- Screening of the mutants is done using assays of breast 
cancer protein activities. 

Amino acid substitutions are typically of single residues; insertions usually 
will be on the order of from about 1 to 20 amino acids, although considerably larger 
insertions may be tolerated. Deletions range from about 1 to about 20 residues, although in 

10 some cases deletions may be much larger. 

Substitutions, deletions, insertions or any combination thereof may be used to 
arrive at a final derivative. Generally these changes are done on a few amino acids to 
minimize the alteration of the molecule. However, larger changes may be tolerated in certain 
circumstances. When small alterations in the characteristics of the breast cancer protein are 

1 5 desired, substitutions are generally made in accordance with the amino acid substitution 
relationships provided in the definition section. 

The variants typically exhibit the same qualitative biological activity and will 
elicit the same immune response as the naturally-occurring analog, although variants also are 
selected to modify the characteristics of the breast cancer proteins as needed Alternatively, 

20 the variant may be designed such that the biological activity of the breast cancer protein is 
altered. For example, glycosylation sites may be altered or removed- 
Substantial changes in function or immunological identity are made by 
selecting substitutions that are less conservative than those described above. For example, 
substitutions may be made which more significantly affect: the structure of the polypeptide 

25 backbone in the area of the alteration, for example the alpha-helical or beta-sheet structure; 
the chaige or hydrophobicity of the molecule at the target site; or the bulk of the side chain. 
The substitutions which in general are expected to produce die greatest changes in the 
polypeptide's properties are those in which (a) a hydrophilic residue, e.g. seryl or threonyl is 
substituted for (or by) a hydrophobic residue, e.g. leucyl, isoleucyl, phenylalanyl, valyl or 

30 alanyl; (b) a cysteine or proline is substituted for (or by) any other residue; (c) a residue 
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having an electropositive side chain, e.g. lysyl, arginyl, or histidyl, is substituted for (or by) 
an electronegative residue, e.g. glutamyl or aspartyl; or (d) a residue having a bulky side 
chain, e.g. phenylalanine, is substituted for (or by) one not having a side chain, e.g. glycine. 

Covalent modifications of breast cancer polypeptides are included within the 
5 scope of this invention. One type of covalent modification includes reacting targeted amino 
acid residues of a breast cancer polypeptide with an organic derivatizing agent that is capable 
of reacting with selected side chains or the N-or C-tenninal residues of a breast cancer 
polypeptide. Derivatization with bifimctional agents is useful, for instance, for crosslinking 
breast cancer polypeptides to a water-insoluble support matrix or surface for use in die 

10 method for purifying anti-breast cancer polypeptide antibodies or screening assays, as is more 
fully described below. Commonly used crosslinking agents include, e.g., 1,1- 
bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N-hydroxysuccinimide esters, e.g., esters 
with 4-azidosalicylic acid, homobifiinctional imidoesters, including disuccinimidyl esters 
such as 3,3*-dithiobis(succinimidy^)ropionate), bifimctional maleimides such as bis-N- 

15 maleimido-l,8-octane and agents such as methyl-3-((p-azidophenyl)didrio)propioimidate. . 

Other modifications include deamidation of glutaminyl and asparaginyl 
residues to the corresponding glutamyl and aspartyl residues, respectively, hydroxylation of 
proline and lysine, phosphorylation of hydroxyl groups of seryl, threonyl or tyrosyl residues, 
methylation of the amino groups of the lysine, arginine, and histidine side chains (Creighton, 

20 Proteins: Structure and Molecular Properties, pp. 79-86 (1983)), acetylation of die N- 
terminal amine, and amidation of any C-tenninal carboxyl group. 

Another type of covalent modification of the breast cancer polypeptide 
included within the scope of this invention comprises altering the native glycosylation pattern 
of the polypeptide. "Altering the native glycosylation pattern" is intended for purposes 

25 herein to mean deleting one or more carbohydrate moieties found in native sequence breast 
cancer polypeptide, and/or adding one or more glycosylation sites that are not present in the 
native sequence breast cancer polypeptide. Glycosylation patterns can be altered in many 
ways. For example the use of different cell types to express breast cancer-associated 
sequences can result in different glycosylation patterns. 
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Addition of glycosylation sites to breast cancer polypeptides may also be 
accomplished by altering the amino acid sequence thereof. The alteration may be made, e.g., 
by the addition o£ or substitution by, one or more serine or threonine residues to the native 
sequence breast cancer polypeptide (for O-linked glycosylation sites). The breast cancer 

5 amino acid sequence may optionally be altered through changes at the DNA level, 

particularly by mutating the DNA encoding the breast cancer polypeptide at preselected bases 
such that codons are generated that will translate into the desired amino acids. 

Another means of increasing the numbef of carbohydrate moieties on the 
breast cancer polypeptide is by chemical or enzymatic coupling of glycosides to the 

10 polypeptide. Such methods are described in the art, e.g., in WO 87/05330, and in Aplin & 
Wriston, CRC Crit. Rev. Biochem., pp. 259-306 (1981). 

Removal of carbohydrate moieties present on the breast cancer polypeptide 
may be accomplished chemically or enzymatically or by mutational substitution of codons 
encoding for amino acid residues that serve as targets for glycosylation. Chemical 

1 5 deglycosylation techniques are known in the art and described, for instance, by Hakimuddin, 
et aL 9 Arch. Biochem. Biophys., 259:52 (1987) and by Edge et aL 9 AnaL Biochem., 1 18:131 
(1 98 1). Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved by the 
use of a variety of endo-and exo-glycosidases as described by Thotakura et al, Meth. 
EnzymoL, 138:350(1987). 

20 Another type of covalent modification of breast cancer comprises linking the 

breast cancer polypeptide to one of a variety of nonproteinaceous polymers, e.g., 
polyethylene glycol, polypropylene glycol, or polyoxyalkylenes, in the manner set forth in 
U.S. Patent Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192 or 4,179,337. 

Breast cancer polypeptides of the present invention may also be modified in a 

25 way to form chimeric molecules comprising a breast cancer polypeptide fused to another, 
heterologous polypeptide or amino acid sequence. In one embodiment, such a chimeric 
molecule comprises a fusion of a breast cancer polypeptide with a tag polypeptide which 
provides an epitope to which an anti-tag antibody can selectively bind. The epitope tag is 
generally placed at the amino-or carboxyl-terminus of the breast cancer polypeptide. The 

30 presence of such epitope-tagged forms of a breast cancer polypeptide can be detected using 
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an antibody against the tag polypeptide. Also, provision of the epitope tag enables the breast 
cancer polypeptide to be readily purified by affinity purification using an anti-tag antibody or 
another type of affinity matrix that binds to the epitope tag. In an alternative embodiment, 
the chimeric molecule may comprise a fusion of a breast cancer polypeptide with an 

5 immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of the 
chimeric molecule, such a fusion could be to the Fc region of an IgG molecule. 

Various tag polypeptides and their respective antibodies are well known in the 
art. Examples include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; 
fflS6 and metal chelation tags, the flu HA tag polypeptide and its antibody 12CA5 (Field et 

10 al, Mol Cell Biol 8:2159-2165 (1988)); the o-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 
9E10 antibodies thereto (Evan et al, Molecular and Cellular Biology 5:3610-3616 (1985)); 
and the Herpes Simplex virus glycoprotein D (gD) tag and its antibody (Paborsky et aL, 
Protein Engineering 3(6):547-553 (1990)). Other tag polypeptides include the Flag-peptide 
(Hopp et al, BioTechnology 6:1204-1210 (1988)); the KT3 epitope peptide (Martin et al, 

15 Science 255:192-194 (1992)); tubulin epitope peptide (Skinner et al., J. Biol Chem. 

266:15163-15166 (1991)); and the T7 gene 10 protein peptide tag (Lutz-Freyermuth et al, 
Proa Natl Acad. ScL USA 87:6393-6397 (1990)). 

Also included are other breast cancer proteins of the breast cancer family, and 
breast cancer proteins from other organisms, which are cloned and expressed as outlined 

20 below. Thus, probe or degenerate polymerase chain reaction (PCR) primer sequences may be 
used to find other related breast cancer proteins from humans or other organisms. As will be 
appreciated by those in the art, particularly useful probe and/or PGR primer sequences 
include the unique areas of die breast cancer nucleic acid sequence. As is generally known in 
the art, preferred PCR primers are from about 15 to about 35 nucleotides in length, with from 

25 about 20 to about 30 being preferred, and may contain inosine as needed. The conditions for 
the PCR reaction are well known in the art (e.g., Innis, PCR Protocols, supra). 

Antibodies to breast cancer proteins 

In a preferred embodiment, when die breast cancer protein is to be used to 
30 generate antibodies, e.g., for immunotherapy or immunodiagnosis, the breast cancer protein 
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should share at least one epitope or determinant with the full length protein. By "epitope" or 
"determinant" herein is typically meant a portion of a protein which will generate and/or bind 
an antibody or T-cell receptor in the context of MHC. Thus, in most instances, antibodies 
made to a smaller breast cancer protein will be able to bind to the full-length protein, 

5 particularly linear epitopes. In a preferred embodiment, the epitope is unique; that is, 
antibodies generated to a unique epitope show little or no cross-reactivity. 

Methods of preparing polyclonal antibodies are known to the skilled artisan 
(e.g., Coligan, supra\ and Harlow & Lane, supra). Polyclonal antibodies can be raised in a 
mammal, e.g., by one or more injections of an immunizing agent and, if desired, an adjuvant. 

10 Typically, the imrn nnigang agent and/or adjuvant will be injected in the mammal by multiple 
subcutaneous or intraperitoneal injections. The immunizing agent may include a protein 
encoded by a nucleic acid of the figures or fragment thereof or a fusion protein thereof. It 
may be useful to conjugate the immunizing agent to a protein known to be immunogenic in 
the mammal being immunized. Examples of such immunogenic proteins include but are not 

15 limited to keyhole limpet hemocyanin, serum albumin, bovine thyroglobulin, and soybean 
trypsin inhibitor. Examples of adjuvants which may be employed include Freund's complete 
adjuvant and MPL-TDM adjuvant (monophosphoiyl Lipid A, synthetic trehalose 
dicorynomycolate). The immunization protocol may be selected by one skilled in the art 
without undue experimentation. 

20 The antibodies may, alternatively, be monoclonal antibodies. Monoclonal 

antibodies may be prepared using hybridoma methods, such as those described by Kohler & 
Milstein, Nature 256:495 (1975). In a hybridoma method, a mouse, hamster, or other 
appropriate host animal, is typically immunized with an immunizing agent to elicit 
lymphocytes that produce or are capable of producing antibodies that will specifically bind to 

25 the immunizing agent. Alternatively, the lymphocytes may be immunized in vitro. The 

immunizing agent will typically include a polypeptide encoded by a nucleic acid of Tables 1- 
25 or fragment thereof, or a fusion protein thereof. Generally, either peripheral blood 
lymphocytes (*TBLs") are used if cells of human origin are desired, or spleen cells or lymph 
node cells are used if non-human mammalian sources are desired. The lymphocytes are then 

30 fused with an immortalized cell line using a suitable fusing agent, such as polyethylene 
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glycol, to form a hybridoma cell (Goding, Monoclonal Antibodies: Principles and Practice, 
pp. 59-103 (1986)). Immortalized cell lines are usually transformed mammalian cells, 
particularly myeloma cells of rodent, bovine and human origin. Usually, rat or mouse 
myeloma cell lines are employed. The hybridoma cells may be cultured in a suitable culture 
5 medium that preferably contains one or more substances that inhibit the growth or survival of 
the unfused, immortalized cells. For example, if the parental cells lack the enzyme 
hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the culture medium 
for the hybridomas typically will include hypoxanthine, aminopterin, and thymidine ("HAT 
medium"), which substances prevent the growth of HGPRT-deficient cells. 

10 In one embodiment, the antibodies are bispecific antibodies. Bispecific 

antibodies are monoclonal, preferably human or humanized, antibodies that have binding 
specificities for at least two different antigens or that have binding specificities for two 
epitopes on the same antigen. In one embodiment, one of the binding specificities is for a 
protein encoded by a nucleic acid Tables 1-25 or a fragment thereof; the other one is for any 

15 other antigen, and preferably for a cell-surface protein or receptor or receptor subunit, 
preferably one that is tumor specific. Alternatively, tetramer-type technology may create 
multivalent reagents. 

In a preferred embodiment, the antibodies to breast cancer protein are capable 
of reducing or eliminating a biological function of a breast cancer protein, as is described 

20 below. That is, the addition of anti-breast cancer protein ^ltibodies (either polyclonal or 

preferably monoclonal) to breast cancer tissue (or cells containing breast cancer) may reduce 
or eliminate the breast cancer. Generally, at least a 25% decrease in activity, growth, size or 
the like is preferred, with at least about 50% being particularly preferred and about a 95- 
100% decrease being especially preferred. 

25 In a preferred embodiment the antibodies to the breast cancer proteins are 

humanized antibodies (e,g., Xenerex Biosciences, Mederex, Inc., Abgenix, Inc., Protein 
Design Labs Jnc.) Humanized forms of non-human (e.g., murine) antibodies are chimeric 
molecules of irnmunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, 
Fab, Fab*, F(ab*)2 or other antigen-binding subsequences of antibodies) which contain 

30 minimal sequence derived from non-human immunoglobulin. Humanized antibodies include 
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human immimoglobulins (recipient antibody) in which residues from a complementary 
determining region (CDR) of the recipient are replaced by residues from a CDR of a non- 
human species (donor antibody) such as mouse, rat or rabbit having the desired specificity, 
affinity and capacity. In some instances, Fv framework residues of the human 

5 immunoglobulin are replaced by corresponding non-human residues. Humanized antibodies 
may also comprise residues which are found neither in the recipient antibody nor in the 
imported CDR or framework sequences. In general, a humanized antibody will comprise 
substantially all of at least one, and typically two, variable domains, in which all or 
substantially all of the CDR regions correspond to those of a non-human immunoglobulin 

1 0 and all or substantially all of the framework (FR) regions are those of a human 

immunoglobulin consensus sequence. The humanized antibody optimally also will comprise 
at least a portion of an immunoglobulin constant region (Fc), typically that of a human 
immunoglobulin (Jones et aL, Nature 321 :522-525 (1986); Riechmann et al, Nature 
332:323-329 (1988); and Tresta, Cuff . Op. Struct. Biol. 2^ Humanization 

15 can be essentially performed following the method of Winter and co-workers (Jones et al, 
Nature 321 :522-525 (1986); Riechmann et al., Nature 332:323-327 (1988); Verhoeyen et al., 
Science 239:1534-1536 (1988)), by substituting rodent CDRs or CDR sequences for the 
corresponding sequences of a human antibody. Accordingly, such humanized antibodies are 
chimeric antibodies (U.S. Patent No. 4,816,567), wherein substantially less than an intact 

20 human variable domain has been substituted by the corresponding sequence from a non- 
human species. 

Human antibodies can also be produced using various techniques known in die 
art, including phage display libraries (Hoogenboom & Winter, J. MoL Biol 227:381 (1991); 
Marks et al., J. MoL Biol 222:581 (1991)). The techniques of Cole et aL and Boerner et al. 

25 are also available for the preparation of human monoclonal antibodies (Cole et aL, 

Monoclonal Antibodies and Cancer Therapy, p. 77 (1985) and Boerner et al, J. Immunol 
147(l):86-95 (1991)). Similarly, human antibodies can be made by introducing of human 
immunoglobulin loci into transgenic animals, e.g., mice in which the endogenous 
immunoglobulin genes have been partially or completely inactivated. Upon challenge, 

30 human antibody production is observed, which closely resembles that seen in humans in all 
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respects, including gene rearrangement, assembly, and antibody repertoire. This approach is 
described, e.g., inU.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 5,633,425; 
5,661,016, and in the following scientific publications: Marks et al.> Bio/Technology 10:779- 
783 (1992); Lonberg et aL, Nature 368:856-859 (1994); Morrison, Nature 368:812-13 

5 (1994); Fishwild et al, Nature Biotechnology 14:845-51 (1996); Neuberger, Nature 

Biotechnology 14:826 (1996); Lonberg & Huszar, Jhfem. Rev. Immunol 13:65-93 (1995). 

By immunotherapy is meant treatment of breast cancer with an antibody raised 
against breast cancer proteins. As used herein, immunotherapy can be passive or active. 
Passive immunotherapy as defined herein is the passive transfer of antibody to a recipient 

10 (patient). Active immunization is the induction of antibody and/or T-cell responses in a 
recipient (patient). Induction of an immune response is the result of providing the recipient 
with an antigen to which antibodies are raised. As appreciated by one of ordinary skill in the 
art, the antigen may be provided by injecting a polypeptide against which antibodies are 
desired to be raised into a recipient, or contacting the recipient with a nucleic acid capable of 

1 5 expressing the antigen and under conditions for expression of the antigen, leading to an 
immune response. 

In a preferred embodiment the breast cancer proteins against which antibodies 
are raised are secreted proteins as described above. Without being bound by theory, 
antibodies used for treatment, bind and prevent the secreted protein from binding to its 

20 receptor, thereby inactivating the secreted breast cancer protein. 

In another preferred embodiment, the breast cancer protein to which 
antibodies are raised is a transmembrane protein. Without being bound by theory, antibodies 
used for treatment, bind the extracellular domain of the breast cancer protein and prevent it 
from binding to other proteins, such as circulating ligands or cell-associated molecules. The 

25 antibody may cause down-regulation of the transmembrane breast cancer protein. As will be 
appreciated by one of ordinary skill in the art, the antibody may be a competitive, non- 
competitive or uncompetitive inhibitor of protein binding to the extracellular domain of the 
breast cancer protein. The antibody is also an antagonist of the breast cancer protein. 
Further, the antibody prevents activation of the transmembrane breast cancer protein. In one 

30 aspect, when the antibody prevents the binding of other molecules to the breast cancer 
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protein, the antibody prevents growth of the cell. The antibody may also be used to target or 
sensitize the cell to cytotoxic agents, including, but not limited to TNF-a, TNF-P, IL-1, INF-y 
and IL-2, or chemotherapeutic agents including 5FU, vinblastine, actinomycin D, cisplatin, 
methotrexate, and the like. In some instances the antibody belongs to a sub-type that 

5 activates serum complement when complexed with the transmembrane protein thereby 
mediating cytotoxicity or antigen-dependent cytotoxicity (ADCQ. Thus, breast cancer is 
treated by administering to a patient antibodies directed against the transmembrane breast 
cancer protein. Antibody-labeling may activate a co-toxin, localize a toxin payload, or 
otherwise provide means to locally ablate cells. 

10 In another preferred embodiment, the antibody is conjugated to an effector 

moiety. The effector moiety can be any number of molecules, including labelling moieties 
such as radioactive labels or fluorescent labels, or can be a therapeutic moiety. In one aspect 
the therapeutic moiety is a small molecule that modulates the activity of the breast cancer 
protein. In another aspect the therapeutic moiety modulates the activity of molecules 

15 associated with or in close proximity to the breast cancer protein. The therapeutic moiety 
may inhibit enzymatic activity such as protease or collagenase or protein kinase activity 
associated with breast cancer. 

In a preferred embodiment, the therapeutic moiety can also be a cytotoxic 
agent In this method, targeting the cytotoxic agent to breast cancer tissue or cells, results in 

20 a reduction in the number of afflicted cells, thereby reducing symptoms associated with 

breast cancer. Cytotoxic agents are numerous and varied and include, but are not limited to, 
cytotoxic drugs or toxins or active fragments of such toxins. Suitable toxins and their 
corresponding fragments include diphtheria A chain, exotoxin A chain, ricin A chain, abrin A 
chain, curcin, crotin, phenomycin, enomycin and the like. Cytotoxic agents also include 

25 radiochemicals made by conjugating radioisotopes to antibodies raised against breast cancer 
proteins, or binding of a radionuclide to a chelating agent that has been covalently attached to 
the antibody. Targeting the therapeutic moiety to transmembrane breast cancer proteins not 
only serves to increase the local concentration of therapeutic moiety in the breast cancer 
afflicted area, but also serves to reduce deleterious side effects that may be associated with 

30 the therapeutic moiety. 
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In another pref eared embodiment, the breast cancer protein against which the 
antibodies are raised is an intracellular protein. In this case, the antibody may be conjugated 
to a protein which facilitates entry into the cell. In one case, the antibody enters the cell by 
endocytosis. In another embodiment, a nucleic acid encoding the antibody is administered to 
5 the individual or cell. Moreover, wherein the breast cancer protein can be targeted within a 
cell, i.e., the nucleus, an antibody thereto contains a signal for that target localization, i.e., a 
nuclear localization signal. 

The breast cancer antibodies of the invention specifically bind to breast cancer 
proteins. By "specifically bind" herein is meant that the antibodies bind to the protein with a 
10 Kd of at least about 0.1 mM, more usually at least about 1 nM, preferably at least about 0.1 
jiM or better, and most preferably, 0.01 pM or better. Selectivity of binding is also 
important 

Detection of breast cancer sequence for diagnostic and therapeutic applications 

15 In one aspect, the RNA expression levels of genes are determined for different 

cellular states in the breast cancer phenotype. Expression levels of genes in normal tissue 
(i.e., not undergoing breast cancer) and in breast cancer tissue (and in some cases, for varying 
severities of breast cancer that relate to prognosis, as outlined below) are evaluated to provide 
expression profiles. An expression profile of a particular cell state or point of development is 

20 essentially a "fingerprint" ofthe state. While two states may have any particular gene 
similarly expressed, the evaluation of a number of genes simultaneously allows the 
generation of a gene expression profile that is reflective ofthe state ofthe cell. By comparing 
expression profiles of cells in different states, information regarding which genes are 
important (including both up- and down-regulation of genes) in each of these states is 

25 obtained Then, diagnosis may be performed or confirmed to determine whether a tissue 
sample has the gene expression profile of normal or cancerous tissue. This will provide for 
molecular diagnosis of related conditions. 

"Differential expression," or grammatical equivalents as used herein, refers to 
qualitative or quantitative differences in fee temporal and/or cellular gene expression 

30 patterns within and among cells and tissue. Thus, a differentially expressed gene can 
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qualitatively have its expression altered, including an activation or inactivation, in, e.g., 
normal versus breast cancer tissue. Genes may be turned on or turned off in a particular state, 
relative to another state thus permitting comparison of two or more states. A qualitatively 
regulated gene will exhibit an expression pattern within a state or cell type which is 
5 detectable by standard techniques. Some genes will be expressed in one state or cell type, but 
not in both. Alternatively, the difference in expression may be quantitative, e.g., in that 
expression is increased or decreased; i.e., gene expression is either upregulated, resulting in 
an increased amount of transcript, or downregulated, resulting in a decreased amount of 
transcript The degree to which expression differs need only be large enough to quantify via 

10 standard characterization techniques as outlined below, such as by use of Aflymetrix 
GeneChip™ expression arrays, Lockhart, Nature Biotechnology 14:1675-1680 (1996), 
hereby expressly incorporated by reference. Other techniques include, but are not limited to, 
quantitative reverse transcriptase PCR, northern analysis and RNase protection. As outlined 
above, preferably the change in expression (i.e., upregulation or downregulation) is at least 

15 about 50%, more preferably at least about 100%, more preferably at least about 150%, more 
preferably at least about 200%, with from 300 to at least 1000% being especially preferred- 
Evaluation may be at the gene transcript, or the protein level. The amount of 
gene expression may be monitored using nucleic acid probes to the DNA or RNA equivalent 
of the gene transcript, and the quantification of gene expression levels, or, alternatively, the 

20 final gene product itself (protein) can be monitored, e.g.,^vith antibodies to the breast cancer 
protein and standard immunoassays (ELISAs, etc.) or other techniques, including mass 
spectroscopy assays, 2D gel electrophoresis assays, etc. Proteins corresponding to breast 
cancer genes, i.e., those identified as being important in a breast cancer phenotype, can be 
evaluated in a breast cancer diagnostic test 

25 hi a preferred embodiment, gene expression monitoring is performed 

simultaneously on a number of genes. Multiple protein expression monitoring can be 
performed as well. Similarly, these assays may be performed on an individual basis as well. 

In this embodiment, the breast cancer nucleic acid probes are attached to 
biochips as outlined herein for the detection and quantification of breast cancer sequences in 
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a particular cell. The assays are further described below in the example. PCR techniques can 
be used to provide greater sensitivity. 

In a preferred embodiment nucleic acids encoding the breast cancer protein are 
detected. Although DNA or KNA encoding the breast cancer protein may be detected, of 

5 particular interest are methods wherein an mRNA encoding a breast cancer protein is 
detected. Probes to detect mRNA can be a nucleotide/deoxynucleotide probe that is 
complementary to and hybridizes with the mRNA and includes, but is not limited to, 
oligonucleotides, cDNA or RNA. Probes also should contain a detectable label, as defined 
herein. In one method the mRNA is detected after immobilizing the nucleic acid to be 

10 examined on a solid support such as nylon membranes and hybridizing the probe with the 
sample. Following washing to remove the non-specifically bound probe, the label is 
detected In another method detection of the mRNA is performed in situ. In this method 
penneabilized cells or tissue samples are contacted with a detectably labeled nucleic acid 
probe for sufficient time to allow the probe to hybridize with the target mRNA. Following 

1 5 washing to remove the non-specifically bound probe, the label is detected. For example a 
digoxygenin labeled riboprobe (RNA probe) that is complementary to the mRNA encoding a 
breast cancer protein is detected by binding the digoxygenin with an anti-digoxygenin 
secondary antibody and developed with nitro blue tetrazolium and 5-bromo-4-chloro-3- 
indoyl phosphate. 

20 In a preferred embodiment, various proteins from the three classes of proteins 

as described herein (secreted, transmembrane or intracellular proteins) are used in diagnostic 
ass&ys. The breast cancer proteins, antibodies, nucleic acids, modified proteins and cells 
containing breast cancer sequences are used in diagnostic assays. This can be performed on 
an individual gene or corresponding polypeptide level. In a preferred embodiment, the 

25 expression profiles are used, preferably in conjunction with high throughput screening 
techniques to allow monitoring for expression profile genes and/or corresponding 
polypeptides. 

As described and defined herein, breast cancer proteins, including 
intracellular, transmembrane or secreted proteins, find use as markers of breast cancer. 
30 Detection of these proteins in putative breast cancer tissue allows for detection or diagnosis 
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of breast cancer. In one embodiment, antibodies are used to detect breast cancer proteins. A 
preferred method separates proteins from a sample by electrophoresis on a gel (typically a 
denaturing and reducing protein gel, but may be another type of gel, including isoelectric 
focusing gels and the like). Following separation of proteins, the breast cancer protein is 

5 detected, e.g., by immunoblotting with antibodies raised against the breast cancer protein. 
Methods of immunoblotting are well known to those of ordinary skill in the art 

In another preferred method, antibodies to the breast cancer protein find use in 
in situ imaging techniques, e.g., in histology (e.g., Methods in Cell Biology: Antibodies in 
Cell Biology, volume 37 (Asai, ed 1993)). In this method cells are contacted with from one 

1 0 to many antibodies to the breast cancer protein(s). Following washing to remove non- 
specific antibody binding, the presence of the antibody or antibodies is detected. In one 
embodiment the antibody is detected by incubating with a secondary antibody that contains a 
detectable label. In another method the primary antibody to the breast cancer protein(s) 
contains a detectable label, e.g. an enzyme marker that can act on a substrate. In another 

1 5 preferred embodiment each one of multiple primary antibodies contains a distinct and 

detectable label. This method finds particular use in simultaneous screening for a plurality of 
breast cancer proteins. As will be appreciated by one of ordinary skill in the art, many other 
histological imaging techniques are also provided by the invention. 

In a preferred embodiment the label is detected in a fluorometer which has the 

20 ability to detect and distinguish emissions of different wavelengths. In addition, a 
fluorescence activated cell sorter (FACS) can be used in the method 

In another preferred embodiment, antibodies find use in diagnosing breast 
cancer from blood, serum, plasma, stool, and other samples. Such samples, therefore, are 
useful as samples to be probed or tested for the presence of breast cancer proteins. 

25 Antibodies can be used to detect a breast cancer protein by previously described 
immunoassay techniques including BUS A, immunoblotting (western blotting), 
immunoprecipitation, BIACORE technology and the like. Conversely, the presence of 
antibodies may indicate an immune response against an endogenous breast cancer protein. 

In a preferred embodiment, in situ hybridization of labeled breast cancer 

30 nucleic acid probes to tissue arrays is done. For example, arrays of tissue samples, including 
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breast cancer tissue and/or normal tissue, are made. In situ hybridization (see, e.g., Ausubel, 
supra) is then performed. When comparing the fingerprints between an individual and a 
standard, the skilled artisan can make a diagnosis, a prognosis, or a prediction based on the 
findings. It is further understood that the genes which indicate the diagnosis may differ from 
5 those which indicate the prognosis and molecular profiling of the condition of the cells may 
lead to distinctions between responsive or refractory conditions or may be predictive of 
outcomes. 

In a preferred embodiment, the breast cancer proteins, antibodies, nucleic 
acids, modified proteins and cells containing breast cancer sequences are used in prognosis 

1 0 assays. As above, gene expression profiles can be generated that correlate to breast cancer, 
in terms of long term prognosis. Again, this may be done on either a protein or gene level, 
with the use of genes being preferred. As above, breast cancer probes may be attached to 
biochips for the detection and quantification of breast cancer sequences in a tissue or patient 
The assays proceed as outlined above for diagnosis. PCR method may provide more 

15 sensitive and accurate quantification. 



Assays for therapeutic compounds 

hi a preferred embodiment members of the proteins, nucleic acids, and 
antibodies as described herein are used in drug screening assays. The breast cancer proteins, 
20 antibodies, nucleic acids, modified proteins and cells containing breast cancer sequences are 
used in drug screening assays or by evaluating the effect of drug candidates on a "gene 
expression profile" or expression profile of polypeptides. In a preferred embodiment, the 
expression profiles arc used, preferably in conjunction with high throughput screening 
techniques to allow monitoring for expression profile genes after treatment with a candidate 
25 agent (e.g., Zlokarnik, et aL 9 Science 279:84-8 (1998); Heid, Genome Res 6:986-94, 1996). 

In a preferred embodiment, the breast cancer proteins, antibodies, nucleic 
acids, modified proteins and cells containing the native or modified breast cancer proteins are 
used in screening assays. That is, the present invention provides novel methods for screening 
for compositions which modulate the breast cancer phenotype or an identified physiological 
30 function of a breast cancer protein. As above, this can be done on an individual gene level or 
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by evaluating the effect of drug candidates on a "gene expression profile". In a preferred 
embodiment, the expression profiles are used, preferably in conjunction with high throughput 
screening techniques to allow monitoring for expression profile genes after treatment with a 
candidate agent, see Zlokarnik, supra. 
5 Having identified the differentially expressed genes herein, a variety of assays 

may be executed. In a preferred embodiment, assays may be run on an individual gene or 
protein level. That is, having identified a particular gene as up regulated in breast cancer, test 
compounds can be screened for the ability to modulate gene expression or for binding to the 
breast cancer protein. 'Modulation" thus includes both an increase and a decrease in gene 

1 0 expression. The preferred amount of modulation will depend on the original change of the 
gene expression in normal versus tissue undergoing breast cancer, with changes of at least 
10%, preferably 50%, more preferably 100-300%, and in some embodiments 300-1000% or 
greater. Thus, if a gene exhibits a 4-fold increase in breast cancer tissue compared to normal 
tissue, a decrease of about four-fold is often desired; similarly, a 10-fold decrease in breast 

15 cancer tissue compared to normal tissue often provides a target value of a 10-fold increase in 
expression to be induced by the test compound. 

The amount of gene expression may be monitored using nucleic acid probes 
and the quantification of gene expression levels, or, alternatively, the gene product itself can 
be monitored, e.g., through the use of antibodies to the breast cancer protein and standard 

20 immunoassays. Proteomics and separation techniques may also allow quantification of 
expression. 

In a preferred embodiment, gene expression or protein monitoring of a number 
of entities, i.e., an expression profile, is monitored simultaneously. Such profiles will 
typically involve a plurality of those entities described herein.. 
25 In this embodiment, the breast cancer nucleic acid probes are attached to 

biochips as outlined herein for the detection and quantification of breast cancer sequences in 
a particular cell. Alternatively, PCR may be used Thus, a series, e.g., of microtiter plate, 
may be used with dispensed primers in desired wells. A PCR reaction can then be performed 
and analyzed for each well. 
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Expression monitoring can be performed to identify compounds that modify 
the expression of one or more breast cancer-associated sequences, e.g., a polynucleotide 
sequence set out inTable 17. Generally, in a preferred embodiment, a test modulator is added 
to the cells prior to analysis. Moreover, screens are also provided to identify agents that 
5 modulate breast cancer, modulate breast cancer proteins, bind to a breast cancer protein, or 
interfere with the binding of a breast cancer protein and an antibody or other binding partner. 

The term "test compound" or "drug candidate" or ''modulator" or grammatical 
equivalents as used herein describes any molecule, e.g., protein, oligopeptide, small organic 
molecule, polysaccharide, polynucleotide, etc., to be tested for the capacity to directly or 

10 indirectly alter the breast cancer phenotype or the expression of a breast cancer sequence, 
e.g., a nucleic acid or protein sequence. In preferred embodiments, modulators alto: 
expression profiles, or expression profile nucleic acids or proteins provided herein. In one 
embodiment, the modulator suppresses a breast cancer phenotype, e.g. to a normal tissue 
fingerprint In another embodiment, a modulator induced a breast cancer phenotype. 

1 5 Generally, a plurality of assay mixtures are run in parallel with different agent concentrations 
to obtain a differential response to the various concentrations. Typically, one of these 
concentrations serves as a negative control, i.e., at zero concentration or below the level of 
detection. , 

Drug candidates encompass numerous chemical classes, though typically they 

20 are organic molecules, preferably small organic compounds having a molecular weight of 
more than 100 and less than about 2,500 daltons. Preferred small molecules are less than 
2000, or less than 1500 or less than 1000 or less than 500 D. Candidate agents comprise 
functional groups necessary for structural interaction with proteins, particularly hydrogen 
bonding, and typically include at least an amine, carbonyl, hydroxyl or carboxyl group, 

25 preferably at least two of the functional chemical groups. The candidate agents often 
comprise cyclical carbon or heterocyclic structures and/or aromatic or polyaromatic 
structures substituted with one or more of the above functional groups. Candidate agents are 
also found among biomolecules including peptides, saccharides, fatty acids, steroids, purines, 
pyrrolidines, derivatives, structural analogs or combinations thereof Particularly preferred 

30 are peptides. 
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In one aspect, a modulator will neutralize the effect of a breast cancer protein. 
By Neutralize" is meant that activity of a protein is inhibited or blocked and the consequent 
effect on the cell* 

In certain embodiments, combinatorial libraries of potential modulators will be 
5 screened for an ability to bind to a breast cancer polypeptide or to modulate activity. 

Conventionally, new chemical entities with useful properties are generated by identifying a 
chemical compound (called a "lead compound") with some desirable property or activity, 
e.g., inhibiting activity, creating variants of the lead compound, and evaluating the property 
and activity of those variant compounds. Often, high throughput screening (HTS) methods 

10 are employed for such an analysis. 

In one preferred embodiment, high throughput screening methods involve 
providing a library containing a large number of potential therapeutic compounds (candidate 
compounds). Such "combinatorial chemical libraries" are then screened in one or more 
assays to identify those library members (particular chemical species or subclasses) that 

IS display a desired characteristic activity. The compounds thus identified can serve as 

conventional "lead compounds" or can themselves be used as potential or actual therapeutics. 

A combinatorial chemical library is a collection of diverse chemical 
compounds generated by either chemical synthesis or biological synthesis by combining a 
number of chemical ''building blocks" such as reagents. For example, a linear combinatorial 

20 chemical library, such as a polypeptide (e.g., mutein) library, is formed by combining a set of 
chemical building blocks called amino acids in every possible way for a given compound 
length (i.e., the number of amino acids in a polypeptide compound). Millions of chemical 
compounds can be synthesized through such combinatorial mixing of chemical building 
blocks (Gallop et aL, J. Med Chem. 37(9): 1233-1251 (1994)). 

25 Preparation and screening of combinatorial chemical libraries is well known to 

those of skill in the art Such combinatorial chemical libraries include, but are not limited to, 
peptide libraries (see, eg., ILS. Patent No. 5,010,175, Furka, Pept Prot Res. 37:487-493 
(1991), Houghton et al y Nature, 354:84-88 (1991)), peptoids (PCT Publication No WO 
91/19735), encoded peptides (PCT Publication WO 93/20242), random bio-oligomers (PCT 

30 Publication WO 92/00091), benzodiazepines (U.S. Pat. No. 5,288,514), diversomers such as 
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hydantoins, benzodiazepines and dipeptides (Hobbs etal, Proa Nat Acad. ScL USA 
90:6909-6913 (1993)), vinylogous polypeptides (Hagihara et al, J. Amer. Chem. Soc. 
1 14:6568 (1992)), nonpeptidal peptidomimetics with a Beta-D-Glucose scaffolding 
(Hirschmann et al, J* Amer. Chem. Soc. 1 14:9217-9218 (1992)), analogous organic syntheses 

5 of small compound libraries (Chen et al, J. Amer. Chem. Soc. 1 16:2661 (1994)), 

oligocarbamates (Cho, et al., Science 261:1303 (1993)), and/or peptidyl phosphonates 
(Campbell et al, J. Org. Chem. 59:658 (1994)). See, generally, Gordon et al, 1 Med. Chem. 
37:1385 (1994), nucleic acid libraries {see, e.g, Strategene, Corp.), peptide nucleic acid 
libraries (see, eg, U.S. Patent 5,539,083), antibody libraries (see, eg, Vaughn et al, Nature 

10 Biotechnology 14(3):309-314 (1996), and PCT/US96/1Q287), carbohydrate libraries (see, 
eg, Liang et al, Science 274:1520-1522 (1996), and U.S. Patent No. 5,593,853), and small 
organic molecule libraries (see, eg, benzodiazepines, Baum, C&EN, Jan 18, page 33 (1993); 
isoprenoids, U.S. Patent No. 5,569,588; thiazolidinones and metathiazanones, U.S. Patent No. , 
5,549,974; pyrrolidines, U.S. Patent Nos. 5,525,735 and 5,519,134; morpholino compounds, 

15 U.S. Patent No. 5,506,337; benzodiazepines, U.S. Patent No. 5,288,514; and the like). 

Devices for the preparation of combinatorial libraries are commercially 
available (see, eg, 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY, Symphony, 
Rainin, Woburn, MA, 433A Applied Biosystems, Foster City, CA, 9050 Plus, Millipore, 
Bedford, MA). 

20 -A number of well known robotic systems have also been developed for 

solution phase chemistries. These systems include automated workstations like the 
automated synthesis apparatus developed by Takeda Chemical Industries, LTD. (Osaka, 
Jap an) and many robotic systems utilizing robotic arms (Zymate II, Zymark Corporation, 
Hopkinton, Mass.; Qrca, Hewlett-Packard, Palo Alto, Calif.), which mimic the manual 

25 synthetic operations performed by a chemist Any of the above devices are suitable for use 
with the present invention. The nature and implementation of modifications to these devices 
(if any) so that they can operate as discussed herein will be apparent to persons skilled in the 
relevant art In addition, numerous combinatorial libraries are themselves commercially 
available (see, eg, ComGenex, Princeton, N.J., Asinex, Moscow, Ru, Tripos, Inc., St Louis, 
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MO, ChemStar, Ltd, Moscow, RU, 3D Pharmaceuticals, Exton, PA, Martek Biosciences, 
Columbia, MD, etc). 

The assays to identify modulators are amenable to high throughput screening. 
Preferred assays thus detect enhancement or inhibition of breast cancer gene transcription, 
5 inhibition or enhancement of polypeptide expression, and inhibition or enhancement of 
polypeptide activity. 

High throughput assays for the presence, absence, quantification, or other 
properties of particular nucleic acids or protein products are well known to those of skill in 
the art Similarly, binding assays and reporter gene assays are similarly well known. Thus, 

10 e.g., U.S. Patent No. 5,559,410 discloses high throughput screening methods for proteins, 
U.S. Patent No. 5,585,639 discloses high througl^ut screening methods for nucleic acid 
binding (i.e., in arrays), while U.S. Patent Nos. 5,576,220 and 5,541,061 disclose high 
throughput methods of screening for ligand/antibody binding. 

In addition, high throughput screening systems are commercially available 

15 {see, e.g., Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman 
Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA, eta). These systems 
typically automate entire procedures, including all sample and reagent pipetting, liquid 
dispensing, timed incubations, and final readings of the microplate in detector(s) appropriate 
for the assay. These configurable systems provide high throughput and rapid start up as well 

20 as a high degree of flexibility and customization. The manufacturers of such systems provide 
detailed protocols for various high throughput systems. Thus, e.g., Zymark Corp. provides 
technical bulletins describing screening systems for detecting the modulation of gene 
transcription, ligand binding, and the like. 

In one embodiment, modulators are proteins, often naturally occurring 

25 proteins or fragments of naturally occurring proteins. Thus, e.g., cellular extracts containing 
proteins, or random or directed digests of proteinaceous cellular extracts, may be used In 
this way libraries of proteins may be made for screening in the methods of the invention. 
Particularly preferred in this embodiment are libraries of bacterial, fungal, viral, and 
mammalian proteins, with the latter being preferred, and human proteins being especially 
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preferred Particularly useful test compound will be directed to the class of proteins to which 
the target belongs, e.g. y substrates for enzymes or ligands and receptors. 

In a preferred embodiment, modulators are peptides of from about 5 to about 
30 amino acids, with from about 5 to about 20 amino acids being preferred, and from about 7 
5 to about 1 5 being particularly preferred The peptides may be digests of naturally occurring 
proteins as is outlined above, random peptides, or "biased" random peptides. By 
"randomized" or grammatical equivalents herein is meant that each nucleic acid and peptide 
consists of essentially random nucleotides and amino acids, respectively. Since generally 
these random peptides (or nucleic acids, discussed below) are chemically synthesized, they 

1 0 may incorporate any nucleotide or amino acid at any position. The synthetic process can be 
designed to generate randomized proteins or nucleic acids, to allow the formation of all or 
most of the possible combinations over the length of the sequence, thus forming a library of 
randomized candidate bioactive proteinaceous agents. 

hi one embodiment, the library is fully randomized, with no sequence 

1 5 preferences or constants at any position. In a preferred embodiment, the library is biased. 
That is, some positions within the sequence are either held constant, or are selected from a 
limited number of possibilities. For example, in a preferred embodiment, the nucleotides or 
amino acid residues are randomized within a defined class, e.g., of hydrophobic amino acids, 
hydrophilic residues, sterically biased (either small or large) residues, towards the creation of 

20 nucleic acid binding domains, the creation of cysteines, for cross-linking, prolines for SH-3 
domains, serines, threonines, tyrosines or histidines for phosphorylation sites, etc., or to 
purines, etc. ' 

Modulators of breast cancer can also be nucleic acids, as defined above. 

As described above generally for proteins, nucleic acid modulating agents may 
25 be naturally occurring nucleic acids, random nucleic acids, or "biased" random nucleic acids. 
For example, digests of procaryotic or eucaryotic genomes may be used as is outlined above 
for proteins. 

In a preferred embodiment, the candidate compounds are organic chemical 
moieties, a wide variety of which are available in the literature. 
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After the candidate agent has been added and the cells allowed to incubate for 
some period of time, the sample containing a target sequence to be analyzed is added to the 
biochip. If required, the target sequence is prepared using known techniques. For example, 
tiie sample may be treated to lyse the cells, using known lysis buffers, electroporation, etc., 
5 with purification and/or amplification such as PGR performed as appropriate. For example, 
an in vitro transcription with labels covalently attached to the nucleotides is performed. 
Generally, the nucleic acids are labeled with biotin-FITC or PE, or with cy3 or cy5. 

In a preferred embodiment, the target sequence is labeled with, e.g., a 
fluorescent, a chemiluminegcent, a chemical, or a radioactive signal, to provide a means of 

10 detecting the target sequence's specific binding to a probe. The label also can be an enzyme, 
such as, alkaline phosphatase or horseradish peroxidase, which when provided with an 
appropriate substrate produces a product that can be detected. Alternatively, the label can be 
a labeled compound or small molecule, such as an enzyme inhibitor, that binds but is not 
catalyzed or altered by the enzyme. The label also can be a moiety or compound, such as, an 

15 epitope tag or biotin which specifically binds to streptavidin. For the example of biotin, the 
streptavidin is labeled as described above, thereby, providing a detectable signal for the 
bound target sequence. Unbound labeled streptavidin is typically removed prior to analysis. 

As will be appreciated by those in the art, these assays can be direct 
hybridization assays or can comprise "sandwich assays", which include the use of multiple 

20 probes, as is generally outlined in U.S. Patent Nos. 5,681,^02, 5,597,909, 5,545,730, 

5,594,117, 5,591,584, 5,571,670, 5,580,731, 5,571,670, 5,591,584, 5,624,802, 5,635,352, 
5,594,118, 5,359,100, 5,124,246 and 5,681,697, all of which are hereby incorporated by 
reference. In this embodiment, in general, the target nucleic acid is prepared as outlined 
above, and then added to the biochip comprising a plurality of nucleic acid probes, under 

25 conditions that allow the formation of a hybridization complex. 

A variety of hybridization conditions may be used in the present invention, 
including high, moderate and low stringency conditions as outlined above. The assays are 
generally run under stringency conditions which allows formation of the label probe 
hybridization complex only in the presence of target. Stringency can be controlled by 

30 altering a step parameter that is a thermodynamic variable, including, but not limited to, 
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temperature, formamide concentration, salt concentration, chaotropic salt concentration pH, 
organic solvent concentration, etc. 

These parameters may also be used to control non-specific binding, as is 
generally outlined in U.S. Patent No. 5,681,697. Thus it may be desirable to perform certain 
5 steps at higher stringency conditions to reduce non-specific binding. 

The reactions outlined herein may be accomplished in a variety of ways. 
Components of the reaction may be added simultaneously, or sequentially, in different orders, 
with preferred embodiments outlined below. In addition, the reaction may include a variety 
of other reagents. These include salts, buffers, neutral proteins, e.g. albumin, detergents, etc. 
10 which may be used to facilitate optimal hybridization and detection, and/or reduce non- 
specific or background interactions. Reagents that otherwise improve the efficiency of the 
assay, such as protease inhibitors, nuclease inhibitors, anti-microbial agents, etc. y may also be 
used as appropriate, depending on the sample preparation methods and purity of the target 

The assay data are analyzed to determine the expression levels, and changes in 
15 expression levels as between states, of individual genes, forming a gene expression profile. 

Screens are performed to identify modulators of the breast cancer phenotype. 
hi one embodiment, screening is performed to identify modulators that can induce or 
suppress a particular expression profile, thus preferably generating the associated phenotype. 
In another embodiment, e*g. 9 for diagnostic applications, having identified differentially 
20 expressed genes important in a particular state, screens can be performed to identify 

modulators that alter expression ofindividual genes. In an another embodiment, screening is 
performed to identify modulators that alter a biological function of the expression product of 
a differentially expressed gene. Again, having identified the importance of a gene in a 
particular state, screens are performed to identify agents that bind and/or modulate the 
25 biological activity of the gene product. 

In addition screens can be done for genes that are induced in response to a 
candidate agent After identifying a modulator based upon its ability to suppress a breast 
cancer expression pattern leading to a normal expression pattern, or to modulate a single 
breast cancer gene expression profile so as to mimic the expression of the gene from normal 
30 tissue, a screen as described above can be performed to identify genes that are specifically 
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modulated in response to the agent Comparing expression profiles between normal tissue 
and agent treated breast cancer tissue reveals genes that are not expressed in normal tissue or 
breast cancer tissue, but axe expressed in agent treated tissue. These agent-specific sequences 
can be identified and used by methods described herein for breast cancer genes or proteins. 
5 In particular these sequences and the proteins they encode find use in marking or identifying 
agent treated cells. In addition, antibodies can be raised against the agent induced proteins 
and used to target novel therapeutics to the treated breast cancer tissue sample. 

Thus, in one embodiment, a test compound is administered to a population of 
breast cancer cells, that have an associated breast cancer expression profile. By 

10 "administration" or "contacting" herein is meant that the candidate agent is added to the cells 
in such a manner as to allow the agent to act upon the cell, whether by uptake and 
intracellular action, or by action at the cell surface. In some embodiments, nucleic acid 
encoding a proteinaceous candidate agent (i.e., a peptide) may be put into a viral construct 
such as an adenoviral or retroviral construct, and added to the cell, such that expression of 

15 the peptide agent is accomplished, e.g., PCT US97/01019. Regulatable gene therapy systems 
can also be used 

Once the test compound has been administered to the cells, the cells can be 
washed if desired and are allowed to incubate under preferably physiological conditions for 
some period of time. The cells are then harvested and a new gene expression profile is 
20 generated, as outlined herein. 

Thus, e.g., breast cancer tissue may be screened for agents that modulate, e.g., 
induce or suppress the breast cancer phenotype. A change in at least one gene, preferably 
many, of the expression profile indicates that the agent has an effect on breast cancer activity. 
By defining such a signature for the breast cancer phenotype, screens for new drugs that alter 
25 the phenotype can be devised. With this approach, the drug target need not be known and 
need not be represented in the original expression screening platform, nor does the level of 
transcript for the target protein need to change. 

In a preferred embodiment, as outlined above, screens may be done on 
individual genes and gene products (proteins). That is, having identified a particular 
30 differentially expressed gene as important in a particular state, screening of modulators of 
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either the expression of the gene or the gene product itself can be done. The gene products of 
differentially expressed genes are sometimes referred to herein as "breast cancer proteins" or 
a "breast cancer modulatory protein". The breast cancer modulatory protein may be a 
fragment, or alternatively, be the foil length protein to the fragment encoded by the nucleic 
5 acids of the Tables. Preferably, the breast cancer modulatory protein is a fragment In a 
preferred embodiment, the breast cancer amino acid sequence which is used to determine 
sequence identity or similarity is encoded by a nucleic acid of Table 25. In another 
embodiment, the sequences are naturally occurring allelic variants of a protein encoded by a 
nucleic acid of Table 25. In another embodiment, the sequences are sequence variants as 

1 0 further described herein. 

Preferably, the breast cancer modulatory protein is a fragment of 
approximately 14 to 24 amino acids long. More preferably the fragment is a soluble 
fragment. Preferably, the fragment includes a non-transmembrane region. In a preferred 
embodiment, the fragment has an N-terminal Cys to aid in solubility. In one embodiment, the 

1 5 C-terminus of the fragment is kept as a free acid and the N-terminus is a free amine to aid in 
coupling, i.e., to cysteine. 

In one embodiment the breast cancer proteins are conjugated to an 
immunogenic agent as discussed herein. In one embodiment the breast cancer protein is 
conjugated to BSA. 

20 Measurements of breast cancer polypeptide activity, or of breast cancer or die 

breast cancer phenotype can be performed using a variety of assays. For example, the effects 
of the test compounds upon the function of the breast cancer polypeptides can be measured 
by examining parameters described above. A suitable physiological change that affects 
activity can be used to assess the influence of a test compound on the polypeptides of this 

25 invention. When the functional consequences are determined using intact cells or animals, 
one can also measure a variety of effects such as, in the case of breast cancer associated with 
tumors, tumor growth, tumor metastasis, neovascularization, hormone release, transcriptional 
changes to both known and uncharacterized genetic markers (e.g., northern blots), changes in 
cell metabolism such as cell growth or pH changes, and changes in intracellular second 
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messengers such as cGMP. In the assays of the invention, mammalian breast cancer 
polypeptide is typically used, e.g., mouse, preferably human. 

Assays to identify compounds with modulating activity can be performed in 
vitro. For example, a breast cancer polypeptide is first contacted with a potential modulator 
5 and incubated for a suitable amount of time, e.g., from 0.5 to 48 hours. In one embodiment, 
the breast cancer polypeptide levels are determined in vitro by measuring the level of protein 
or mRNA. The level of protein is measured using immunoassays such as western blotting, 
FT IS A and the like with an antibody that selectively binds to the breast cancer polypeptide or 
a fragment thereof. For measurement of mRNA, amplification, e.g., using PCR, LCR, or 

10 hybridization assays, e.g., northern hybridization, RNAse protection, dot blotting, are 
preferred. The level of protein or mRNA is detected using directly or indirectly labeled 
detection agents, e.g., fluorescently or radioactively labeled nucleic acids, radioactively or 
enzymatically labeled antibodies, and the like, as described herein. 

Alternatively, a reporter gene system can be devised using the breast cancer 

1 5 protein promoter operably linked to a reporter gene such as luciferase, green fluorescent 
protein, CAT, or p-gal. The reporter construct is typically transfected into a cell. After 
treatment with a potential modulator, the amount of reporter gene transcription, translation, or 
activity is measured according to standard techniques known to those of skill in the art 
In a preferred embodiment, as outlined above, screens may be done on 

20 individual genes and gene products (proteins). That is, having identified a particular 

differentially expressed gene as important in a particular state, screening of modulators of the 
expression of the gene or the gene product itself can be done. The gene products of 
differentially expressed genes are sometimes referred to herein as "breast cancer proteins." 
The breast cancer protein may be a fragment, or alternatively, be the full length protein to a 

25 fragment shown herein. 

In one embodiment, screening for modulators of expression of specific genes 
is performed. Typically, the expression of only one or a few genes are evaluated In another 
embodiment, screens are designed to first find compounds that bind to differentially 
expressed proteins. These compounds are then evaluated for the ability to modulate 
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differentially expressed activity. Moreover, once initial candidate compounds are identified, 
variants can be further screened to better evaluate structure activity relationships. 

In a preferred embodiment, binding assays are done. In general, purified or 
isolated gene product is used; that is, the gene products of one or more differentially 
5 expressed nucleic acids are made. For example, antibodies are generated to the protein gene 
products, and standard immunoassays are run to determine the amount of protein present 
Alternatively, cells comprising the breast cancer proteins can be used in the assays. 

Thus, in a preferred embodiment, the methods comprise combining a breast 
cancer protein and a candidate compound, and determining the binding of the compound to 

10 the breast cancer protein. Preferred embodiments utilize the human breast cancer protein, 
although other mammalian proteins may also be used, e.g. for the development of animal 
models of human disease. In some embodiments, as outlined herein, variant or derivative 
breast cancer proteins may be used. 

Generally, in a preferred embodiment of the methods herein, the breast cancer 

1 5 protein or the candidate agent is non-diflusably bound to an insoluble support having isolated 
sample receiving areas (e.g. a microliter plate, an array, etc.). The insoluble supports may be 
made of any composition to which the compositions can be bound, is readily separated from 
soluble material, and is otherwise compatible with the overall method of screening. The 
surface of such supports may be solid or porous and of any convenient shape. Examples of 

20 suitable insoluble supports include microtiter plates, arrack, membranes and beads. These are 
typically made of glass, plastic (e.g., polystyrene), polysaccharides, nylon or nitrocellulose, 
teflon™, etc. Microtiter plates and arrays are especially convenient because a large number 
of assays can be carried out simultaneously, using small amounts of reagents and samples. 
The particular manner of binding of the composition is not crucial so long as it is compatible 

25 with the reagents and overall methods of the invention, maintains the activity of the 

composition and is nondifiusable. Preferred methods of binding include the use of antibodies 
(which do not sterically block either the ligand binding site or activation sequence when the 
protein is bound to the support), direct binding to "stick/* or ionic supports, chemical 
crosslinking, the synthesis of the protein or agent on the surface, etc. Following binding of 

30 the protein or agent, excess unbound material is removed by washing. The sample receiving 
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areas may then be blocked through incubation with bovine serum albumin (BS A), casein or 
other innocuous protein or other moiety. 

In a preferred embodiment, the breast cancer protein is bound to the support, 
and a test compound is added to the assay. Alternatively, the candidate agent is bound to the 
5 support and the breast cancer protein is added. Novel binding agents include specific 
antibodies, non-natural binding agents identified in screens of chemical libraries, peptide 
analogs, etc. Of particular interest are screening assays for agents that have a low toxicity for 
human cells. A wide variety of assays may be used for this purpose, including labeled in 
vitro protein-protein binding assays, electrophoretic mobility shift assays, immunoassays for 

10 protein binding, functional assays (phosphorylation assays, etc.) and the like. 

The determination of the binding of the test modulating compound to the 
breast cancer protein may be done in a number of ways. In a preferred embodiment, die 
compound is labeled, and binding determined directly, e.g., by attaching all or a portion of 
the breast cancer protein to a solid support, adding a labeled candidate agent (e.g., a 

1 5 fluorescent label), washing off excess reagent, and determining whether the label is present 
on the solid support Various blocking and washing steps may be utilized as appropriate. 

hi some embodiments, only one of the components is labeled, e.g., the 
proteins (or proteinaceous candidate compounds) can be labeled. Alternatively, more than 
one component can be labeled with different labels, e.gl, l25 l for the proteins and a fluorophor 

20 for the compound. Proximity reagents, e.g., quenching or energy transfer reagents are also 
useful. 

In one embodiment, die binding of the test compound is determined by 
competitive binding assay. The competitor is a binding moiety known to bind to the target 
molecule (i.e., a breast cancer protein), such as an antibody, peptide, binding partner, ligand, 

25 etc. Under certain circumstances, there may be competitive binding between the compound 
and the binding moiety, with the binding moiety displacing the compound. In one 
embodiment, the test compound is labeled. Either the compound, or the competitor, or both, 
is added first to the protein for a time sufficient to allow binding, if present Incubations may 
be performed at a temperature which facilitates optimal activity, typically between 4 and 

30 40°C. Incubation periods are typically optimized, e.g., to facilitate vapid high throughput 
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screening. Typically between 0.1 and 1 hour will be sufficient Excess reagent is generally 
removed or washed away. The second component is then added, and the presence or absence 
of the labeled component is followed, to indicate binding. 

In a preferred embodiment, the competitor is added first, followed by the test 
5 compound. Displacement of the competitor is an indication that the test compound is binding 
to the breast cancer protein and thus is capable of binding to, and potentially modulating, the 
activity of the breast cancer protein. In this embodiment, either component can be labeled 
Thus, e.g., if the competitor is labeled, the presence of label in the wash solution indicates 
displacement by the agent Alternatively, if the test compound is labeled, the presence of the 

10 label on the support indicates displacement 

In an alternative embodiment, the test compound is added first, with 
incubation and washing, followed by the competitor. The absence of binding by the 
competitor may indicate that the test compound is bound to the breast cancer protein with a 
higher affinity. Thus, if the test compound is labeled, the presence of the label on the 

1 5 support, coupled with a lack of competitor binding, may indicate that the test compound is 
capable of binding to the breast cancer protein. 

In a preferred embodiment, the methods comprise differential screening to 
identity agents that are capable of modulating the activity of the breast cancer proteins. In 
this embodiment the methods comprise combining a breast cancer protein and a competitor 

20 in a first sample. A second sample comprises a test compound, a breast cancer protein, and a 
competitor. The binding of the competitor is determined for both samples, and a change, or 
difference in binding between the two samples indicates the presence of an agent capable of 
binding to the breast cancer protein and potentially modulating its activity. That is, if the 
binding of the competitor is different in the second sample relative to the first sample, the 

25 agent is capable of binding to the breast cancer protein. 

Alternatively, differential screening is used to identify drug candidates that 
bind to the native breast cancer protein, but cannot bind to modified breast cancer proteins. 
The structure of the breast cancer protein may be modeled, and used in rational drug design 
to synthesize agents that interact with that site. Drug candidates that affect the activity of a 
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breast cancer protein are also identified by screening drugs for the ability to either enhance or 
reduce the activity of the protein. 

Positive controls and negative controls may be used in the assays. Preferably 
control and test samples are performed in at least triplicate to obtain statistically significant 
5 results. Incubation of all samples is for a time sufficient for the binding of the agent to the 
protein. Following incubation, samples are washed free of non-specifically bound material 
and the amount of bound, generally labeled agent determined. For example, where a 
radiolabel is employed, the samples may be counted u\ a scintillation counter to determine the 
amount of bound compound. 
10 A variety of other reagents may be included in the screening assays. These 

include reagents like salts, neutral proteins, e.g. albumin, detergents, etc. which may be used 
to facilitate optimal protein-protein binding and/or reduce non-specific or background 
interactions. Also reagents that otherwise improve the efficiency of the assay, such as 
protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may be used. The mixture 
IS of components may be added in an order that provides for the requisite binding. 

In a preferred embodiment, the invention provides methods for screening for a 
compound capable of modulating the activity of a breast cancer protein. The methods 
comprise adding a test compound, as defined above, to a cell comprising breast cancer 
proteins. Preferred cell types include almost any cell. The cells contain a recombinant 
20 nucleic acid that encodes a breast cancer protein. In a preferred embodiment, a library of 
candidate agents are tested on a plurality of cells. 

In one aspect, the assays are evaluated in the presence or absence or previous 
or subsequent exposure of physiological signals, e.g. hormones, antibodies, peptides, 
antigens, cytokines, growth factors, action potentials, pharmacological agents including 
25 chemotherapeutics, radiation, carcinogenics, or other cells (i.e. cell-cell contacts). In another 
example, the determinations are determined at different stages of the cell cycle process. 

In this way, compounds that modulate breast cancer agents are identified. 
Compounds with pharmacological activity are able to enhance or interfere with the activity of 
the breast cancer protein. Once identified, similar structures are evaluated to identify critical 
30 structural feature of the compound. 
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In one embodiment, a method of inhibiting breast cancer cell division is 
provided The method comprises administration of a breast cancer inhibitor. In another 
embodiment, a method of inhibiting breast cancer is provided The method comprises 
administration of a breast cancer inhibitor. In a further embodiment, methods of treating cells 
5 or individuals with breast cancer are provided The method comprises administration of a 
breast cancer inhibitor. 

In one embodiment, a breast cancer inhibitor is an antibody as discussed 
above. In another embodiment, the breast cancer inhibitor is an antisense molecule. 

A variety of cell growth, proliferation, and metastasis assays are known to 
10 those of skill in the art, as described below. 

Soft agar growth or colony formation in suspension 

Normal cells require a solid substrate to attach and grow. When the cells are 
transformed, they lose this phenotype and grow detached from the substrate. For example, 
transformed cells can grow in stirred suspension culture or suspended in semi-solid media, 
1 5 such as semi-solid or soft agar. The transformed cells, when transfected with tumor 

suppressor genes, regenerate normal phenotype and require a solid substrate to attach and 
grow. Soft agar growth or colony formation in suspension assays can be used to identify 
modulators of breast cancer sequences, which when expressed in host cells, inhibit abnormal 
cellular proliferation and transformation. A therapeutic compound would reduce or eliminate 
20 the host cells* ability to grow in stirred suspension culture or suspended in semi-solid media, 
such as semi-solid or soft. 

Techniques for soft agar growth or colony formation in suspension assays are 
described in Freshney, Culture of Animal Cells a Manual of Basic Technique (3 rd ed, 1994), 
herein incorporated by reference. See also, the methods section of Garkavtsev et al (1996), 
25 supra, herein incorporated by reference. 

Contact inhibition and density limitation of growth 

Normal cells typically grow in a flat and organized pattern in apetri dish until 
they touch other cells. When the cells touch one another, they are contact inhibited and stop 
growing. When cells are transformed, however, the cells are not contact inhibited and 
30 continue to grow to high densities in disorganized foci. Thus, the transformed cells grow to a 
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higher saturation density than normal cells. This can be detected morphologically by the 
formation of a disoriented monolayer of cells or rounded cells in foci within the regular 
pattern of normal surrounding cells. Alternatively, labeling index with (^-thymidine at 
saturation density can be used to measure density limitation of growth. See Freshney (1994), 
5 supra. The transformed cells, when transfected with tumor suppressor genes, regenerate a 
normal phenotype and become contact inhibited and would grow to a lower density. 

In this assay, labeling index with ( 3 H)-thymidine at saturation density is a 
preferred method of measuring density limitation of growth. Transformed host cells are 
transfected with a breast cancer-associated sequence and are grown for 24 hours at saturation 
1 0 density in non-Hmitirig medium conditions. The percentage of cells labeling with ( 3 H)- 
thymidine is determined autoradiographically. See> Freshney (1994), supra. 

Growth factor or serum dependence 

Transformed cells have a lower serum dependence than their normal 
15 counterparts {see, e.g, Temin, J. Nail Cancer Insti. 37:167-175 (1966); Eagle et al, 1 Exp. 
Med. 131 :836-879 (1970)); Freshney, supra. This is in part due to release of various growth 
factors by the transformed cells. Growth factor or serum dependence of transformed host 
cells can be compared with that of control. 

Tumor specific markers levels 

Tumor cells release an increased amount of certain factors (hereinafter "tumor 
specific markers") than their normal counterparts. For example, plasminogen activator (PA) 
is released from human glioma at a higher level than from normal brain cells (see, e.g., 
Gullino, Angiogenesis, tumor vascularization, and potential interference with tumor growth, 
in Biological Responses in Cancer, pp. 178-184 (Mihich (ed.) 1985)). Similarly, Tumor 
angiogenesis factor (TAF) is released at a higher level in tumor cells than their normal 
counterparts. See, e.g., Folkman, Angiogenesis and Cancer, Sem Cancer Biol (1992)). 

Various techniques which measure the release of these factors are described in 
Freshney (1994), supra. Also, see, Unkless etal ,J. Biol Chem. 249:4295-4305 (1974); 
Strickland & Beers, J.Biol Chem. 251:5694-5702 (1976); Wmretal, Br. J. Cancer 42:305- 
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312 (1980); Gullino, Angiogenesis, tumor vascularization, and potential interference with 
tumor growth in Biological Responses in Cancer, pp. 178-184 (Mihich (ed.) 1985); 
Freshney Anticancer Res. 5:111-130(1985). 

5 Invasiveness into Matrigel 

The degree of invasiveness into Matrigel or some other extracellular matrix 
constituent can be used as an assay to identify compounds that modulate breast cancer- 
associated sequences. Tumor cells exhibit a good correlation between malignancy and 
invasiveness of cells into Matrigel or some other extracellular matrix constituent In this 
1 0 assay, tumorigenic cells are typically used as host cells. Expression of a tumor suppressor 
gene in these host cells would decrease invasiveness of the host cells. 

Techniques described in Freshney (1994), supra, can be used. Briefly, the 
level of invasion of host cells can be measured by using filters coated with Matrigel or some 
other extracellular matrix constituent Penetration into the gel, or through to the distal side of 
15 the filter, is rated as invasiveness, and rated histologically by number of cells and distance 

moved, or by prelabeling the cells with l25 I and counting the radioactivity on the distal side of 
the filter or bottom of the dish. See, e.g, Freshney (1 984), supra. 

Tumor growth in vivo 

20 Effects of breast cancer-associated sequences on cell growth can be tested in 

transgenic or immune-suppressed mice. Knock-out transgenic mice can be made, in which 
the breast cancer gene is disrupted or in which a breast cancer gene is inserted. Knock-out 
transgenic mice can be made by insertion of a marker gene or other heterologous gene into 
the endogenous breast cancer gene site in the mouse genome via homologous recombination. 

25 Such mice can also be made by substituting the endogenous breast cancer gene with a 

mutated version of the breast cancer gene, or by mutating the endogenous breast cancer gene, 
e.g., by exposure to carcinogens. 

A DNA construct is introduced into the nuclei of embryonic stem cells. Cells 
containing the newly engineered genetic lesion are injected into a host mouse embryo, which 

30 is re-implanted into a recipient female. Some of these embryos develop into chimeric mice 
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that possess germ cells partially derived from the mutant cell line. Therefore, by breeding the 
chimeric mice it is possible to obtain a new line of mice containing the introduced genetic 
lesion (see, e.g, Capecchi et al, Science 244:1288 (1989)). Chimeric targeted mice can be 
derived according to Hogan et al. , Manipulating the Mouse Embryo: A Laboratory Manual, 
5 Cold Spring Harbor Laboratory (1988) and Teratocarcinomas and Embryonic Stem Cells: A . 
Practical Approach, Robertson, ed., IRL Press, Washington, D.C., (1987). 

Alternatively, various immune-suppressed or immune-deficient host animals 
can be used. For example, genetically athymic "nude" mouse (see, e.g, Giovanella et al 9 J. 
Natl Cancer Inst 52:921 (1974)), a SCID mouse, a thymectomized mouse, or an irradiated 

10 mouse (see, e.g, Bradley et al, Br J. Cancer 38:263 (1978); Selby et al, Br. J. Cancer 
41 :52 (1980)) can be used as a host. Transplantable tumor cells (typically about 10 6 cells) 
injected into isogenic hosts will produce invasive tumors in a high proportions of cases, while 
normal cells of similar origin will not In hosts which developed invasive tumors, cells 
expressing a breast cancer-associated sequences are injected subcutaneously. After a suitable 

15 length of time, preferably 4-8 weeks, tumor growth is measured (e.g., by volume or by its two 
largest dimensions) and compared to the control. Tumors that have statistically significant 
reduction (using, e.g., Student's T test) are said to have inhibited growth. 



Polynucleotide modulators of breast cancer 

20 Antisense Polynucleotides 

In certain embodiments, the activity of a breast cancer-associated protein is 
down-regulated, or entirely inhibited, by the use of antisense polynucleotide, i.e., a nucleic 
acid complementary to, and which can preferably hybridize specifically to, a coding mRNA 
nucleic acid sequence, e.g., a breast cancer protein mRNA, or a subsequence thereof 

25 Binding of the antisense polynucleotide to the mRNA reduces the translation and/or stability 
of the mRNA. 

In the context of this invention, antisense polynucleotides can comprise 
naturally-occurring nucleotides, or synthetic species formed from naturally-occuiring 
subunits or their close homologs. Antisense polynucleotides may also have altered sugar 
30 moieties or inter-sugar linkages. Exemplary among these are the phosphorothioate and other 
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sulfur containing species which are known for use in the art Analogs are comprehended by 
this invention so long as they function effectively to hybridize with the breast cancer protein 
mRNA. See, e.g., Isis Pharmaceuticals, Carlsbad, CA; Sequitor, Inc., Natick, MA. 

Such antisense polynucleotides can readily be synthesized using recombinant 
5 means, or can be synthesized in vitro. Equipment for such synthesis is sold by several 
vendors, including Applied Biosystems. The preparation of other oligonucleotides such as 
phosphorothioates and alkylated derivatives is also well known to those of skill in the art. 

Antisense molecules as used herein include antisense or sense 
oligonucleotides. Sense oligonucleotides can, e.g., be employed to block transcription by 

10 binding to the anti-sense strand The antisense and sense oligonucleotide comprise a single- 
stranded nucleic acid sequence (either RNA or DNA) capable of binding to target mRNA 
(sense) or DNA (antisense) sequences for breast cancer molecules. A preferred antisense 
molecule is for a breast cancer sequences in Tables 1-25, or for a ligand or activator thereof 
Antisense or sense oligonucleotides, according to the present invention, comprise a fragment 

15 generally at least about 14 nucleotides, preferably from about 14 to 30 nucleotides. The 
ability to derive an antisense or a sense oligonucleotide, based upon a cDNA sequence 
encoding a given protein is described in, e.g., Stein & Cohen (Cancer Res. 48:2659 (1988 and 
van der Krol et al (BioTechniques 6:958 (1988)). 

20 Ribozymes 

In addition to antisense polynucleotides, ribozymes can be used to target and 
inhibit transcription of breast cancer-associated nucleotide sequences. A ribozyme is an 
RNA molecule that catalytically cleaves other RNA molecules. Different kinds of ribozymes 
have been described, including group I ribozymes, hammerhead ribozymes, hairpin 

25 ribozymes, RNase P, and axhead ribozymes (see, e.g. g Castanotto et al., Adv. in 

Pharmacology 25: 289-3 17 (1994) for a general review of the properties of different 
ribozymes). 

The general features of hairpin ribozymes are described, e.g., in Hampel et al, 
NucL Acids Res. 18:299-304 (1990); European Patent Publication No. 0 360 257; U.S. Patent 
30 No. 5,254,678. Methods of preparing are well known to those of skill in the art (see, e.g, 
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WO 94/26877; Ojwang et al 9 Proc. Natl Acad. Set USA 90:6340-6344 (1993); Yamada et 
a/., Human Gene Therapy 1:39-45 (1994); Leavitt et al 9 Proa Natl Acad. Sci. USA 92:699- 
703 (1995); Leavitt et al 9 Human Gene Therapy 5: 1 151-120 (1994); and Yamada et al, 
Virology 205: 121-126 (1994)). 
5 Polynucleotide modulators of breast cancer may be introduced into a cell 

containing the target nucleotide sequence by formation of a conjugate with a ligand binding 
molecule, as described in WO 91/04753. Suitable ligand binding molecules include, but are 
not limited to, cell surface receptors, growth factors, other cytokines, or other ligands that 
bind to cell surface receptors. Preferably, conjugation of the ligand binding molecule does 

1 0 not substantially interfere with the ability of the ligand binding molecule to bind to its 

corresponding molecule or receptor, or block entry of the sense or antisense oligonucleotide 
or its conjugated version into the cell. Alternatively, a polynucleotide modulator of breast 
cancer may be introduced into a cell containing the target nucleic acid sequence, e.g., by 
formation of an polynucleotide-lipid complex, as described in WO 90/10448. It is 

1 5 understood that the use of antisense molecules or knock out and knock in models may also be 
used in screening assays as discussed above, in addition to methods of treatment 

Thus, in one embodiment, methods of modulating breast cancer in cells or 
organisms are provided. In one embodiment, the methods comprise administering to a cell an 
anti-breast cancer antibody that reduces or eliminates the biological activity of an endogenous 

20 breast cancer protein. Alternatively, the methods comprise administering to a cell or 
organism a recombinant nucleic acid encoding a breast cancer protein. This' may be 
accomplished in any number of ways. In a preferred embodiment, e.g. when the breast 
cancer sequence is down-regulated in breast cancer, such state may be reversed by increasing 
the amount of breast cancer gene product in the cell. This can be accomplished, e.g., by 

25 overexpressing the endogenous breast cancer gene or administering a gene encoding the 
breast cancer sequence, using known gene-therapy techniques, e.g.. In a preferred 
embodiment, the gene therapy techniques include the incorporation of the exogenous gene 
using enhanced homologous recombination (EHR), e.g. as described in PCT/US93/03868, 
hereby incorporated by reference in its entirety. Alternatively, e.g. when the breast cancer 
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sequence is up-regulated in breast cancer, the activity of the endogenous breast cancer gene is 
decreased, e.g. by the administration of abreast cancer antisense nucleic acid 

In one embodiment, the breast cancer proteins of the present invention may be 
used to generate polyclonal and monoclonal antibodies to breast cancer proteins. Similarly, 
5 the breast cancer proteins can be coupled, using standard technology, to affinity 
chromatography columns. These columns may then be used to purify breast cancer 
antibodies useful for production, diagnostic, or therapeutic purposes. In a preferred 
embodiment, the antibodies are generated to epitopes unique to a breast cancer protein; that 
is, the antibodies show little or no cross-reactivity to other proteins. The breast cancer 
10 antibodies may be coupled to standard affinity chromatography columns and used to purify 
breast cancer proteins. The antibodies may also be used as blocking polypeptides, as outlined 
above, since they will specifically bind to the breast cancer protein. 

Methods of identifying variant breast cancer-associated sequences 

1 5 Without being bound by theory, expression of various breast cancer sequences 

is correlated with breast cancer. Accordingly, disorders based on mutant or variant breast 
cancer genes may be determined. In one embodiment, die invention provides methods for 
identifying cells containing variant breast cancer genes, e.g., determining all or part of the 
sequence of at least one endogenous breast cancer genes in a cell. This may be accomplished 

20 using any number of sequencing techniques. In a preferred embodiment, the invention 

provides methods of identifying the breast cancer genotype of an individual, e.g., determining 
all or part of the sequence of at least one breast cancer gene of the individual. This is 
generally done in at least one tissue of the individual, and may include the evaluation of a 
number of tissues or different samples of the same tissue. The method may include 

25 comparing the sequence of the sequenced breast cancer gene to a known breast cancer gene, 
i.e., a wild-type gene. 

The sequence of all or part of the breast cancer gene can then be compared to 
the sequence of a known breast cancer gene to determine if any differences exist This can be 
done using any number of known homology programs, such as Bestfit, etc. In a preferred 

30 embodiment, the presence of a difference in the sequence between die breast cancer gene of 
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the patient and the known breast cancer gene correlates with a disease state or a propensity 
for a disease state, as outlined herein. 

In a preferred embodiment, the breast cancer genes are used as probes to 
determine the number of copies of the breast cancer gene in the genome. 
5 In another preferred embodiment, the breast cancer genes are used as probes to 

determine the chromosomal localization of the breast cancer genes. Information such as 
chromosomal localization finds use in providing a diagnosis or prognosis in particular when 
chromosomal abnormalities such as translocations, and the like are identified in the breast 
cancer gene locus. 

10 

Administration of pharmaceutical and vaccine compositions 

In one embodiment, a therapeutically effective dose of a breast cancer protein 
or modulator thereof is administered to a patient By "therapeutically effective dose" herein 
is meant a dose that produces effects for which it is administered. The exact dose will depend 

15 on tfie purpose of the treatment, and will be ascertainable by one skilled in the art using 
known techniques (e.g., Ansel et al , Pharmaceutical Dosage Forms and Drug Delivery, 
Lieberman, Pharmaceutical Dosage Forms (vols. 1-3, 1992), Dekker, ISBN 0824770846, 
082476918X, 0824712692, 0824716981; Lloyd, The Art, Science and Technology of 
Pharmaceutical Compounding (1999); and Pickar, Dosage Calculations (1999)). As is 

20 known in the art, adjustments for breast cancer degradation, systemic versus localized 

delivery, and rate of new protease synthesis, as well as the age, body weight, general health, 
sex, diet, time of administration, drug interaction and the severity of the condition may be 
necessary, and will be ascertainable with routine experimentation by those skilled in the art 
U.S. Patent Application N. 09/687,576, further discloses the use of compositions and 

25 methods of diagnosis and treatment in breast cancer is hereby expressly incorporated by 
reference. 

A "patient" for the purposes of the present invention includes both humans 
and other animals, particularly mammals. Thus the methods are applicable to both human 
therapy and veterinary applications. In the preferred embodiment the patient is a mammal, 
30 preferably a primate, and in the most preferred embodiment the patient is human. 
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The administration of the breast cancer proteins and modulators thereof of the 
present invention can be done in a variety of ways as discussed above, including, but not 
limited to, orally, suhcutaneously, intravenously, intranasally, transdermally, 
intraperitoneally, intramuscularly, intrapulmonary, vaginally, rectally, or intraocularly. In 
5 some instances, e.g., in the treatment of wounds and inflammation, the breast cancer proteins 
and modulators may be directly applied as a solution or spray. 

The pharmaceutical compositions of the present invention comprise a breast 
cancer protein in a form suitable for administration to a patient In the preferred embodiment, 
the pharmaceutical compositions are in a water soluble form, such as being present as 

10 pharmaceutically acceptable salts, which is meant to include both acid and base addition 
salts. "Pharmaceutically acceptable acid addition salt" refers to those salts that retain the 
biological effectiveness of the free bases and that are not biologically or otherwise 
undesirable, formed with inorganic acids such as hydrochloric acid, hydrobromic acid, 
sulfuric acid, nitric acid, phosphoric acid and the like, and organic acids such as acetic acid, 

1 5 propionic acid, glycolic acid, pyruvic acid, oxalic acid, maleic acid, malonic acid, succinic 
acid, fomaric acid, tartaric acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, 
methanesulfonic acid, ethanesulfonic acid, p-toluenesulfonic acid, salicylic add and the like. 
'Thannaceutically acceptable base addition salts" include those derived from inorganic bases 
such as sodium, potassium, lithium, ammonium, calcium, magnesium, iron, zinc, copper, 

20 manganese, aluminum salts and the like. Particularly preferred are the ammonium, 

potassium, sodium, calcium, and magnesium salts. Salts derived from pharmaceutically 
acceptable organic non-toxic bases include salts of primary, secondary, and tertiary amines, 
substituted amines including naturally occurring substituted amines, cyclic amines and basic 
ion exchange resins, such as isopropylamine, trimethylamine, diethylamine, triethylamine, 

25 tripropylamine, and ethanolamine. 

The pharmaceutical compositions may also include one or more of the 
following: carrier proteins such as serum albumin; buffers; fillers such as microcrystalline 
cellulose, lactose, com and other starches; binding agents; sweeteners and other flavoring 
agents; coloring agents; and polyethylene glycol. 
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The pharmaceutical compositions can be administered in a variety of unit 
dosage forms depending upon the method of administration. For example, unit dosage forms 
suitable for oral administration include, but are not limited to, powder, tablets, pills, capsules 
and lozenges. It is recognized that breast cancer protein modulators (e.g., antibodies, 
5 antisense constructs, ribozymes, small organic molecules, etc.) when administered orally, 
should be protected from digestion. This is typically accomplished either by complexing the 
molecule(s) with a composition to render it resistant to acidic and enzymatic hydrolysis, or by 
packaging the molecule(s) in an appropriately resistanf carrier, such as a liposome or a . 
protection barrier. Means of protecting agents ftom digestion are well known in the art 

1 o The compositions for administration will commonly comprise a breast cancer 

protein modulator dissolved in a pharmaceutically acceptable carrier, preferably an aqueous 
carrier. A variety of aqueous carriers can be used, e.g., buffered saline and the like. These 
solutions are sterile and generally free of undesirable matter. These compositions may be 
sterilized by conventional, well known sterilization techniques. The compositions may 

1 S contain pharmaceutically acceptable auxiliary substances as required to approximate 

physiological conditions such as pH adjusting and buffering agents, toxicity adjusting agents 
and the like, e.g., sodium acetate, sodium chloride, potassium chloride, calcium chloride, 
sodium lactate and the like. The concentration of active agent in these formulations can vary 
widely, and will be selected primarily based on fluid volumes, viscosities, body weight and 

20 the like in accordance with the particular mode of administration selected and the patient's 
needs (e.g., Remington *s Pharmaceutical Science (15th ed, 1980) and Goodman & Gilhnan, 
The Pharmacologial Basis of Therapeutics (Hardman et o/.,eds., 1996)). 

Thus, a typical pharmaceutical composition for intravenous administration 
would be about 0.1 to 10 mg per patient per day. Dosages from 0.1 up to about 100 mg per 

25 patient per day may be used, particularly when the drug is administered to a secluded site and 
not into the blood stream, such as into a body cavity or into a lumen of an organ. 
Substantially higher dosages are possible in topical administration. Actual methods for 
preparing parenterally administrable compositions will be known or apparent to those skilled 
in the art, e.g., Remington 's Pharmaceutical Science and Goodman and Gilhnan, The 

30 Pharmacologial Basis oj ~ Therapeutics* supra. 
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The compositions containing modulators of breast cancer proteins can be 
administered for therapeutic or prophylactic treatments. In therapeutic applications, 
compositions are administered to a patient suffering from a disease (e.g., a cancer) in an 
amount sufficient to cure or at least partially arrest the disease and its complications. An 
5 amount adequate to accomplish this is defined as a 'therapeutically effective dose." Amounts 
effective for this use will depend upon the severity of the disease and the general state of the 
patient's health. Single or multiple administrations of the compositions may be administered 
depending on the dosage and frequency as required and tolerated by the patient In any event, 
the composition should provide a sufficient quantity of the agents of this invention to 

10 effectively treat the patient An amount of modulator that is capable of preventing or slowing 
the development of cancer in a mammal is referred to as a '*prophylacticaily effective dose." 
The particular dose required for a prophylactic treatment will depend upon the medical 
condition and history of the mammal, the particular cancer being prevented, as well as other 
factors such as age, weight, gender, administration route, efficiency, etc. Such prophylactic 

1 5 treatments may be used, e.g., in a mammal who has previously had cancer to prevent a 

recurrence of the cancer, or in a mammal who is suspected of having a significant likelihood 
of developing cancer. 

It will be appreciated that the present breast cancer protein-modulating 
compounds can be administered alone or in combination with additional breast cancer 

20 modulating compounds or with other therapeutic agent, eg., other anti-cancer agents or 
treatments. 

In numerous embodiments, one or more nucleic acids, e.g., polynucleotides 
comprising nucleic acid sequences set forth in Tables 1-25, such as antisense polynucleotides 
or ribozymes, will be introduced into cells, in vitro or in vivo. The present invention provides 
25 methods, reagents, vectors, and cells useful for expression of breast cancer-associated 
polypeptides and nucleic acids using in vitro (cell-free), ex vivo or in vivo (cell or 
organism-based) recombinant expression systems. 

The particular procedure used to introduce the nucleic acids into a host cell for 
expression of a protein or nucleic acid is application specific. Many procedures for 
30 introducing foreign nucleotide sequences into host cells may be used. These include the use 
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of calcium phosphate transfection, spheroplasts, electroporation, liposomes, microinjection, 
plasma vectors, viral vectors and any of the other well known methods for introducing cloned 
genomic DNA, cDNA, synthetic DNA or other foreign genetic material into a host cell {see, 
e.g. t Berger & Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology 
5 volume 1 52 (Berger), Ausubel et al , eds., Current Protocols (supplemented through 1999), 
and Sambrook et al. 9 Molecular Cloning - A Laboratory Manual (2nd ed., Vol. 1-3, 1989. 

In a preferred embodiment, breast cancer proteins and modulators are 
administered as therapeutic agents, and can be formulated as outlined above. Similarly, 
breast cancer genes (including both the full-length sequence, partial sequences, or regulatory 

1 0 sequences of the breast cancer coding regions) can be administered in a gene therapy 
application. These breast cancer genes can include antisense applications, either as gene 
therapy (i.e. for incorporation into the genome) or as antisense compositions, as will be 
appreciated by those in the art 

Breast cancer polypeptides and polynucleotides can also be administered as 

1 5 vaccine compositions to stimulate HTL, CTL and antibody responses.. Such vaccine 

compositions can include, e.g., lipidated peptides (see, e.g;,Vitiello, A. et al. 9 J. Clin. Invest 
95:341 (1995)), peptide compositions encapsulated in poly(DL-lactide-co-glycolide) (*TLG") 
microspheres (see, eg., Eldridge, et al, Molec. Immunol 28:287-294, (1991); Alonso et al 9 
Vaccine 12:299-306 (1994); Jones et al. 9 Vaccine 13:675-681 (1995)), peptide compositions 

20 contained in immune stimulating complexes (ISCOMS) (see, e.g., Takahashi et al. 9 Nature 
344:873-875 (1990); Hu et al 9 Clin Exp Immunol 113:235-243 (1998)), multiple antigen 
peptide systems (MAPs) (see, e.g. 9 Tam, Proc. Natl Acad Sci. U.S.A. 85:5409-5413 (1988); 
Tarn, J. Immunol Methods 196:17-32 (1996)), peptides formulated as multivalent peptides; 
peptides for use in ballistic delivery systems, typically crystallized peptides, viral delivery 

25 vectors (Pedcus, et al. 9 In: Concepts in vaccine development (Kaufinann, ed., p. 379, 1996); 
Chakrabarti, et al. 9 Nature 320:535 (1986); Hu et al 9 Nature 320:537 (1986); Kieny, et al. 9 
AIDS Bio/Technology 4:790 (1986); Top et al 9 J. Infect. Dis. 124:148 (1971); Chanda et al 9 
Virology 175:535 (1990)), particles of viral or synthetic origin (see, eg. 9 Kofler et al 9 J. 
Immunol Methods. 192:25 (1996); Eldridge et al 9 Sem. Hematol 30:16 (1993); Falo et al 9 

30 Nature Med 7:649 (1995)), adjuvants (Wairen et al 9 Annu. Rev. Immunol. 4:369 (1986); 
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Gupta et al, Vaccine 11:293 (1993)), liposomes (Reddy etal, J.Immunol 148:1585 (1992); 
Rock, Immunol Today 17:131 (1996)), or, naked or particle absorbed cDNA (Ulmer, et al, 
Science 259:1745 (1993); Robinson et al y Vaccine 11:957(1993); Shiver al y In: Concepts 
in vaccine development (Kaufinann, ed., p. 423, 1996); Cease & Berzofsky, Anntu Rev. 
5 Immunol 12:923 (1994) and Eldridge et al, Sem. Hematol 30:16 (1993)). Toxin-targeted 
delivery technologies, also known as receptor mediated targeting, such as those of Avant 
Immunotherapeutics, Inc. (Needham, Massachusetts) may also be used. 

Vaccine compositions often include adjuvants. Many adjuvants contain a 
substance designed to protect the antigen from rapid catabolism, such as aluminum hydroxide 

10 or mineral oil, and a stimulator of immune responses, such as lipid A, Bortadella pertussis or 
Mycobacterium tuberculosis derived proteins. Certain adjuvants are commercially available 
as, e.g., Freund's Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, 
MI); Merck Adjuvant 65 (Merck and Company, Inc., Rahway, NT); AS-2 (SmithKline 
Beecham, Philadelphia, PA); aluminum salts such as aluminum hydroxide gel (alum) or 

15 aluminum phosphate; salts of calcium, iron or zinc; an insoluble suspension of acylated 
tyrosine; acylated sugars; cationically or anionically derivatized polysaccharides; 
polyphosphazenes; biodegradable microspheres; monophosphoryl lipid A and quil A. 
Cytokines, such as GM-CSF, interleukin-2, -7, -12, and other like growth factors, may also be 
used as adjuvants. 

20 Vaccines can be administered as nucleic acj^d compositions wherein DNA or 

RNA encoding one or more of the polypeptides, or a fragment thereof^ is administered to a 
patient This approach is described, for instance, in Wolff aL, Science 247:1465 (1990) as 
well as U.S. Patent Nos. 5,580,859; 5,589,466; 5,804,566; 5,739,118; 5,736,524; 5,679,647; 
WO 98/04720; and in more detail below. Examples of DNA-based delivery technologies 

25 include "naked DNA", facilitated (bupivicaine, polymers, peptide-mediated) delivery, 

cationic lipid complexes, and particle-mediated Q'geno gun") or pressure-mediated delivery 
(see, e.g., U.S. PatentNo. 5,922,687). 

For therapeutic or prophylactic immunization purposes, the peptides of the 
invention can be expressed by viral or bacterial vectors. Examples of expression vectors 

30 include attenuated viral hosts, such as vaccinia or fowlpox. This approach involves the use of 
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vaccinia virus, e.g., as a vector to express nucleotide sequences that encode breast cancer 
polypeptides or polypeptide fragments. Upon introduction into a host, the recombinant 
vaccinia virus expresses the immunogenic peptide, and thereby elicits an immune response. 
Vaccinia vectors and methods useful in immunization protocols are described in, e.g., U.S. 
5 Patent No. 4,722,848. Another vector is BCG (Bacille Calmette Guerin). BCG vectors are 
described in Stover et al. 9 Nature 351:456-460 (1991). A wide variety of other vectors useful 
for therapeutic administration or immunization e.g. adeno and adeno-associaied virus vectors, 
retroviral vectors, Salmonella typhi vectors, detoxified anthrax toxin vectors, and the like, 
will be apparent to those skilled in the art from the description herein (see, e.g., Shata et aL, 

10 Mol Med Today 6:66-71 (2000); Shedlock et al, JLeukoc 5^/68:793-806 (2000); Hipp et 
aljn Vivo 14:571-85 (2000)). 

Methods for the use of genes as DNA vaccines are well known, and include 
placing a breast cancer gene or portion of a breast cancer gene under the control of a 
regulatable promoter or a tissue-specific promoter for expression in a breast cancer patient. 

15 The breast cancer gene used for DNA vaccines can encode full-length breast cancer proteins, 
but more preferably encodes portions of the breast cancer proteins including peptides derived 
from the breast cancer protein. In one embodiment, a patient is immunized with a DNA 
vaccine comprising a plurality of nucleotide sequences derived from a breast cancer gene. For 
example, breast cancer-associated genes or sequence encoding subfragments of a breast 

20 cancer protein are introduced into expression vectors and tested for their immunogenicity in 
the context of Class I MHC and an ability to generate cytotoxic T cell responses. This 
procedure provides for production of cytotoxic T cell responses against cells which present 
antigen, including intracellular epitopes. 

In a preferred embodiment, the DNA vaccines include a gene encoding an 

25 adjuvant molecule with the DNA vaccine. Such adjuvant molecules include cytokines that 
increase the immunogenic response to the breast cancer polypeptide encoded by the DNA 
vaccine. Additional or alternative adjuvants are available. 

In another preferred embodiment breast cancer genes find use in generating 
animal models of breast cancer. When the breast cancer gene identified is repressed or 

30 diminished in cancer tissue, gene therapy technology, e.g., wherein antisense RNA directed 
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to the breast cancer gene will also diminish or repress expression of the gene. Animal models 
of breast cancer find use in screening for modulators of a breast cancer-associated sequence 
or modulators of breast cancer. Similarly, transgenic animal technology including gene 
knockout technology, e.g. as a result of homologous recombination with an appropriate gene 
5 targeting vector, will result in the absence or increased expression of the breast cancer 
protein. When desired, tissue-specific expression or knockout of the breast cancer protein 
may be necessary. 

It is also possible that the breast cancer protein is overexpressed in breast 
cancer. As such, transgenic animals can be generated that overexpress the breast cancer 
10 protein- Depending on the desired expression level, promoters of various strengths can be 
employed to express the transgene. Also, the number of copies of the integrated transgene 
can be determined and compared for a determination of the expression level of the transgene. 
Animals generated by such methods find use as animal models of breast cancer and are 
additionally useful in screening for modulators to treat breast cancer. 

15 

Kits for Use in Diagnostic and/or Prognostic Applications 

For use in diagnostic, research, and therapeutic applications suggested above, 
kits are also provided by the invention. In the diagnostic and research applications such kits 
may include any or all of the following: assay reagents, buffers, breast cancer-specific nucleic 

20 acids or antibodies, hybridization probes and/or primers, antisense polynucleotides, 

ribozymes, dominant negative breast cancer polypeptides or polynucleotides, small molecules 
inhibitors of breast cancer-associated sequences etc, A therapeutic product may include 
sterile saline or another phannaceutically acceptable emulsion and suspension base. 

In addition, the kits may include instructional materials containing directions 

25. (i.e., protocols) for the practice of the methods of this invention. While the instructional 
materials typically comprise written or printed materials they are not limited to such. Any 
medium capable of storing such instructions and communicating them to an end user is 
contemplated by this invention. Such media include, but are not limited to electronic storage 
media (e.g., magnetic discs, tapes, cartridges, chips), optical media (e.g., CD ROM), and the 
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like. Such media may include addresses to internet sites that provide such instructional 
materials. 

The present invention also provides for kits for screening for modulators of 
breast cancer-associated sequences. Such kits can be prepared from readily available 

5 materials and reagents. For example, such kits can comprise one or more of the following 
materials: a breast cancer-associated polypeptide or polynucleotide, reaction tubes, and 
instructions for testing breast cancer-associated activity. Optionally, the kit contains 
biologically active breast cancer protein. A wide variety of kits and components can be 
prepared according to the present invention, depending upon the intended user of the kit and 

10 the particular needs of the user. Diagnosis would typically involve evaluation of a plurality 
of genes or products. The genes will be selected based on correlations with important 
parameters in disease which may be identified in historical or outcome data. 



15 EXAMPLES 

Example 1 : Tissue Preparation, Labeling Chips, and Fingerprints 

Purifying total RNA from tissue sample using TRIzol Reagent 

20 The sample weight is first estimated. The tissue samples are homogenized in 

1 ml of TRIzol per 50 mg of tissue using a homogenizer (e.g., Polytron 3100). The size of 
the generator/probe used depends upon die sample amount A generator that is too large for 
the amount of tissue to be homogenized will cause a loss of sample and lower RNA yield A 
larger generator (e.g., 20 mm) is suitable for tissue samples weighing more than 0.6 g. Fill 

25 tubes should not be overfilled. If the working volume is greater than 2 ml and no greater than 
10 ml, a 15 ml polypropylene tube (Falcon 2059) is suitable for homogenization. 

Tissues should be kept frozen until homogenized. The TRIzol is added 
directly to the frozen tissue before homogenization. Following homogenization, the insoluble 
material is removed from the homogenate by centrifugation at 7500 x g for 15 min. in a 

30 Sorvall superspeed or 12,000 x g for 10 min. in an Eppendorf centrifuge at 4°C. The cleared 
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homogenate is then transferred to a new tube(s). Samples may be frozen and stored at -60 to 

-70°C for at least one month or else continue with Ihe purification. 

The next process is phase separation. The homogenized samples are incubated 
for 5 minutes at room temperature. Then, 0.2 ml of chloroform per 1ml ofTRIzol reagent is 
5 added to the homogenization mixture. The tubes are securely capped and shaken vigorously 
by hand (do not vortex) for 15 seconds. The samples are then incubated at room temp, for 
2-3 minutes and next centrifuged at 6500 rpm in a Sorvall superspeed for 30 min. at 4oC. 

The next process is RNA Precipitation. The aqueous phase is transferred to a 
fresh tube. The organic phase can be saved if isolation of DNA or protein is desired. Then 
10 0.5 ml of isopropyl alcohol is added per 1ml ofTRIzol reagent used in the original 

homogenization. Then, the tubes are securely capped and inverted to mix. The samples are 
then incubated at room temp, for 10 minutes an centrifuged at 6500 rpm in Sorvall for 20 

min.at4°C. 

The RNA is then washed. The supernatant is poured off and the pellet washed 
15 with cold 75% ethanoL 1 ml of 75% ethanol is used per 1 ml of the TRIzol reagent used in 
the initial homogenization. The tubes are capped securely and inverted several times to 
loosen pellet without vortexing. They are next centrifuged at <8000 rpm(<7500xg) for 5 
minutes at 4°C. 

The RNA wash is decanted. The pellet is carefully transferred to an 
20 Eppendorf tube (sliding down the tube into the new tube by use of a pipet tip to help guide it 
in if necessary). Tube(s) sizes for precipitating the RNA depending on the working volumes. 
Larger tubes may take too long to dry. Dry pellet. The RNA is then resuspended in an 
appropriate volume (e.g., 2 -5 ug/ul) of DEPC H 2 0. The absorbance is then measured. 

The poly A+ mRNA may next be purified from total RNA by other methods 
25 such as Qiagen* sRNeasykit The poly A + mRNA is purified from total RNA by adding the 
oligotex suspension which has been heated to 37°C and mixing prior to adding to RNA. 
The Elution Buffer is incubated at 70°C. If there is precipitate in the buffer, warm up the 2 x 
Binding Buffer at 65°C. The the total RNA is mixed with DEPOtreated water, 2 x Binding 
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Buffer, and Oligotex according to Table 2 on page 16 of the Oligotex Handbook and next 

incubated for 3 minutes at 65°C and 10 minutes at room temperature. 

The preparation is centrifiiged for 2 minutes at 14,000 to 18,000 g, preferably, 
at a "soft setting," The supernatant is removed without disturbing Oligotex pellet A little bit 
5 of solution can be left behind to reduce the loss of Oligotex. The supernatant is saved until 

satisfactory binding and elution of poly A + mRNA has been found. 

Then, the preparation is gently resusp ended in Wash Buffer OW2 and pipetted 

onto the spin column and centrifuged at full speed (soft setting if possible) for 1 minute. 

Next, the spin column is transferred to a new collection tube and gently 
10 resuspended in Wash Buffer OW2 and centrifuged as described herein. 

Then, the spin column is transferred to anew tube and eluted with 20 to 100 ul 

of preheated (70°C) Elution Buffer. The Oligotex resin is gently resuspended by pipetting up 

and down. The centrifugation is repeated as above and the elution repeated with fresh elution 

buffer or first eluate to keep the elution volume low. 
15 The absorbance is next read to determine the yield, using diluted Elution * 

Buffer as the blank. 

Before proceeding with cDNA synthesis, the mRNA is precipitated before 

proceeding with cDNA synthesis, as components leftover or in the Elution Buffer from the 

Oligotex purification procedure will inhibit downstream enzymatic reactions of the mRNA. 
20 0.4 vol. of 7.5 M NH40Ac + 2.5 vol. of cold 100% etharill is added and the preparation 

precipitated at -20°C 1 hour to overnight (or 20-30 min. at -70°C), and centrifuged at 

14,000-16,000 x g for 30 minutes at 4°C. Next, the pellet is washed with 0.5 ml of 80% 

ethanol (-20°Q and then centrifuged at 14,000-16,000 x g for 5 minutes at room temperature. 

The80% ethanol wash is then repeated. The last bit of ethanol from the pellet is then dried 
25 without use of a speed vacuum and the pellet is then resuspended in DEPC H2O at lug/ul 

concentration. 

Alternatively the RNA mav be purified using other methods fe.p.. Qiagen's RNeasv kit). 
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No more than 100 ug is added to the RNeasy column. The sample volume is 
adjusted to 100 ul with RNase-free water. 350 ul Buffer RLT and then 250 ul ethanol 
(100%) are added to the sample. The preparation is then mixed by pipetting and applied to an 
RNeasy mini spin column for centrifugation (1 5 sec at >1 0,000 rpm). If yield is low, reapply 
5 the flowthrough to the column and centrifuge again. 

Then, transfer column to a new 2 ml collection tube and add 500 ul Buffer 
RPE and centrifiige for 15 sec at >10,000 rpm. The flowthrough is discarded. 500 ul Buffer 
RPE and is then added and the preparation is centriuged for 15 sec at >10,000 rpm. The 
flowthrough is discarded, and the column membrane dried by centrifiiging for 2 min at 
10 maximum speed. The column is transferred to a new 1 .5-ml collection tube. 30-50 ul of 

RNase-free water is applied directly onto column membrane. The column is then centrifuged 
for 1 min at >1 0,000 rpm and the elution step repeated. 

The absorbance is then read to determine yield. If necessary, the material may 
be ethanol precipitated with ammonium acetate and 2.5X volume 1 00% ethanol. 

15 

First Strand cDNA Synthesis 

The first strand can be make using using Gibco's "Superscript Choice System 
for cDNA Synthesis" kit The starting material is 5 ug of total RNA or lug of polyAf 
mRNAL For total RNA, 2 ul of Superscript RT is used; for polyA+ mRNA, 1 ul of 
20 Superscript RT is used. The final volume of first strand synthesis mix is 20 ul. The RNA 
should be in a volume no greater than 10 ul. The RNA is incubated with 1 ulof lOOpmol 

T7-T24 oligo for 1 0 min at 70°C followed by addition on ice of 7 ul of: 4ul 5X 1 st Strand 
Buffer, 2 ul of 0.1M DTT, and 1 ul of lOmM dNTP mix. The preparation is then incubated at 
37°C for 2 min before addition of the Superscript RT followed by incubation at 37°C for 1 
25 hour. 

Second Strand Synthesis 

For the second strand synthesis, place 1st strand reactions on ice and add; 91 

ul DEPC H 2 0; 30 ul 5X 2nd Strand Buffer; 3 ul lOmM dNTP mix; 1 ul 10 U/ul E.coli DNA 

30 Ligase; 4 ul 10 U/ul E.coli DNA Polymerase; and 1 ul 2 U/ul RNase H. Mix and incubate 2 
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hours at 16°C. Add 2 ul T4 DNA Polymerase. Incubate 5 min at 16°C. Add lOulof 0.5M 
EDTA. 

CleflTiin ff up cDNA 

5 The cDNA is purified using Phenol :Chlorofonn:Isoamyl Alcohol (25:24: 1) 

and Phase-Lock gel tubes. The PLG tubes are centrifuged for 30 sec at maximum speed. 
The cDNA mix is then transferred to PLG tube. An equal volume of 
phenol:chlorofoim:isamyl alcohol is then added, the preparation shaken vigorously (no 
vortexing), and centrifuged for 5 minutes at maximum speed. The top aqueous solution is 
1 0 transferred to a new tube and ethanol precipitated by adding 7.5X 5M NH40Ac and 2.5X 
volume of 100% ethanol. Next, it is centrifuged immediately at room temperature for 20 
min, maximum speed. The supernatant is removed, and the pellet washed with 2X with cold 
80% ethanol. As much ethanol wash as possible should be removed before air drying the 
pellet; and resuspending it in 3 ul RNase-free water. 

15 

In vitro Transcription (TVD and labeling with biotin 

In vitro Transcription (TVT) and labeling with biotin is performed as follows: 
Pipet 1 .5 ul of cDNA into a thin-wall PCR tube. Make NTP labeling mix by combining 2 ul 
17 lOxATP (75 mM) (Ambion); 2 ul T7 lOxGTP (75 mM) (Ambion); 1.5 ul T7 IQxCTP (75 

20 mM) (Ambion); 1.5 ul T7 lOxUTP (75 mM) (Ambion); 3.75 ul 10 mM Bio-ll-UTP 

(Boehringer-Mannheim/Roche or Enzo); 3.75 ul 10 mM Bio-16-CTP (Enzo); 2 ul lOx T7 
transcription buffer (Ambion); and 2 ul lOx T7 enzyme mix (Ambion). The final volume is 
20 ul. Incubate 6 hours at 37°C in a PCR machine. The RNA can be furthered cleaned. 
Clean-up follows the previous instructions for RNeasy columns or Qiagen's KNeasy protocol 

25 handbook. The cRNA often needs to be ethanol precipitated by resuspension in a volume 
compatible with the fragmentation step. 

Fragmentation is performed as follows. 15 ug of labeled RNA is usually 
fragmented. Try to minimize the fragmentation reaction volume; a 1 0 id volume is 
recommended but 20 ul is all right Do not go higher than 20 ul because the magnesium in 

30 the fragmentation buffer contributes to precipitation in the hybridization buffer. Fragment 
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RNA by incubation at 94 C for 35 minutes in 1 x Fragmentation buffer (5 x Fragmentation 
buffer is 200 mM Tris-acetate, pH 8.1; 500 mM KOAc; 150 mM MgOAc). The labeled 
RNA transcript can be analyzed before and after fragmentation. Samples can be heated to 
65°C for 15 minutes and electrophoiesed on 1% agarose/TBE gels to get an approximate idea 
of the transcript size range 

For hybridization, 200 ul (10 ug cRNA) of a hybridization mix is put on the 
chip. If multiple hybridizations are to be done (such as cycling through a 5 chip set), then it 
is recommended that an initial hybridization mix of 300 ul or more be made. The 
hybridization mix is: fragment labeled RNA (50 ng/ul final cone); 50 pM 948-b control 
oligo; 1.5 pM BioB; 5 pM BioC; 25 pM BioD; 100 pM CRE; 0.1 mgfail herring sperm DNA; 
0.5 mg/ml acetylated BSA; and 300 ul with lxMES hyb buffer. 

The hybridization reaction is conducted with non-biotinylated IVT (purified 
by RNeasy columns) (see example 1 for steps from tissue to IVT): The following mixture is 
prepared: 

IVT anti sense RNA; 4 fig: \il 

Random Hexamers (1 ng/jil): 4 jxl 

H 2 0: id 

14 \xl 

Incubate the above 14 \xl mixture at 70°C for 10 min.; then put on ice. 

The Reverse transcription procedure uses the following mixture: 
0.1MDTT: 3 (xl 

SOXdNTPmix: 0.6 pi 

H 2 0: 2.4^1 
(^3orQr5dUTP(lmM): 3 
SS RT II (BRL): 1 jil 

16fil 

The above solution is added to the hybridization reaction and incubated for 30 min., 42°C. 

Then, 1 \il SSII is added and incubated for another hour before being placed on ice. 
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The SOX dNTP mix contains 25mM of cold dATP, dCTP, and dGTP, lOmM 
of dTTP and is made by adding 25 |xl each of lOOmM dATP, dCTP, and dGTP; 10 pi of 
lOOmM dTTP to 15 pi H 2 0. ] 

RNA degradation is performed as follows. Add 86 pi H 2 0, 1 .5 pi 1M NaOH/ 
2 mM EDTA and incubate at 65°C, 10 min.. For U-Con 30, 500 pi IE/sample spin at 7000 g 
for 10 min, save flow through for purification. For Qiagen purification, suspend u-con 
recovered material in 500 pi buffer PB and proceed using Qiagen protocol. For DNAse 
digestion, add 1 ul of 1/100 dilution of DNAse/30 ul Rx and incubate at 37°C for 15 min. 
Incubate at 5 min 95°C to denature the DNAse. 

Sample preparation 

For sample preparation, add Cot-1 DNA, 10 pi; SOX dNTPs, 1 pi; 20X SSC, 2.3 pi; Na pyro 
phosphate, 7.5 pi; 10 mgfrnl Herring sperm DNA; 1 ul of 1/10 dilution to 21.8 final vol. Dry 
in speed vac. Resuspend in 15 pi H 2 0. Add 0.38 pi 10% SDS. Heat 95°C, 2 min and slow 
cool at room temp, for 20 min. Put on slide and hybridize overnight at 64°C. Washing after 
the hybridization: 3X SS(Y0.03% SDS: 2 min., 37.5 ml 20X SSOK).75ml 10% SDS in 
250ml H 2 0; IX SSC: 5 min., 12.5 ml 20X SSC in 250ml H 2 0; 0.2X SSC: 5 min., 2.5 ml 20X 
SSC in 250ml H2O. Dry slides and scan at appropiate PMT's and channels. 
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TABLE 1: Figure 1 from BRCA 001 US 

5 Table 1 shows genes, (incorporated in their entirety here and throughout the application 
where primekeys are provided), downregulated in tumor tissue compared to normal breast 
tissue. 



10 

Pfcey: Unique Eos probesel identifier number 

ExAccn: Exenptar Accessfon nur^ 

UnlgeneJD: IMgene ranter 

UnlgeneTlfle: LWgenegenefitie 

15 R1: Raft) of normal breast tissue to tumor 



Pfcey ExAccn UnigenelD UnfeeneTftte R1 

20 100472 D90084 Hs.1023 pyruvate (tehydrogenase (Bpoarntde) alpha 5 

100499 T51985 Hs383108 herroglobm, gamma G 10 

100545 M55405 gb:Honiosaplerernu*(M^ 5 

100549 BE142019 Hs£22Q56 Horno sapiens cDNAFUl 1572 fis, done HE 10 

100613 X52078 Hs.101047 traitsatpflun tauter 3 (E2A torounogtobul 5 

25 100635 BE258039 Hs.129953 Bwjgsarcona breakpoint region 1 5 

100645 X16841 Hs.167988 neural eel adhesion molecule 1 5 

100654 A03758 NMJM)477^Hornosapfera 10 

100702 L27065 gbifuman neuro&roma^ 5 

100915 M60832 Ha249239 coflagen, type VT1I, alpha 2 5 

30 100971 BE379727 Ha83213 fetly add binding protein 4, adipocyte 10 

101125 AJB50562 H&82749 bare nTBrnbr ar iB4superfam^inBnte 5 

101166 M90424 Ha2099 Ifoccalto 1 feroteto mfcrafing faster m 5 

101184 rM.001674 Hs.460 acfralkigfranscf^ factor 3 10 

101336 NM-006732 Hs.75678 Faimur^osteosarcornav^ciKngeneh 10 

35 101367 XD3350 Hs.4 afccM cWiydrogenase IB (c^ 10 
101447 M21305 gb^urnanajphasatelfiteandsate!fie3 ' 10 

101461 N98569 Hs.76422 prwsphofcase A2, group HA (platelets, 10 

101511 MZ7826 H&267319 ewtogenous retroviral protease 10 

101634 AV65Q2G2 HsJ5765 GRQ2 oncogene 5 

40 101736 M74447 Hs*502 transporter 2, ATP-toKflhg cassette; sub £ 10 

102208 U22961 gbitumaninRNAdMiewffisini^ f 10 

102297 NKL001504 Hs.198252 G protetwoupted receptor 9 5 

102450 U48251 Hs.75871 protein WrtaseCbWing protein 1 10 

1QS15 U89337 Hs.169886 tenascmXB 10 

45 102571 U60115 H&239069 fourandahatfUMcbmalral 5 

102800 AA313538 gb£ST185419 Colon caianorra (HCC) ceO 10 

102857 NKL006744 HsJ6461 retinc*^rQpiDteh4 t lntei3m 10 

102990 AA829286 H&332053 serum amyloid A1 10 

103434 X98065 H&54433 tanascin R (mstneth, Janusm) 5 

50 103747 AA081995 gb2n26d06 jri Stratagene neuroepBneBum 10 

103750 AA126129 gbzrn78c07 j1 Stratagene risurcepifoefium 5 

103812 AA137107 N&326391 Homo sapiens. ckmeR^C^6638. inRNA.com 10 

103851 AA326216 HsJ719 hypome^ protein MGC1 136 5 

104080 AB041036 Hs57771 kaDBcrelri 11 (KLK11; TtSP; PRSSaOchjjpp 5 

55 104093 R507Z7 Ha*36970 ESTs 10 

104106 AM22123 gtxzv26h12j1 Sc»re3jtt«ytPu_S1 Homosapi 5 

104109 AL353957 HsJ84181 riypometiralprotehDKFZp434P0^ 10 

104250 F06638 Hs.12440 Homo sapiens done 24734 rNRNA sequence 10 

104340 AA426189 gfczwIleO&rl SceresJMWPtLSI Hcmosapl 5 

60 104492 N73185 Hsj94285 EST 10 

104506 N91071 Hs.109650 ESTs 10 

104511 N99542 Hs£72 orosornucoM 1 5 

104532 AI498763 K&203013 hypothetical protein FU12748 10 
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104536 R24024 Hs.158101 H(anosaplo»cDNAaJ14673fis.ctaneNr 5 

104572 Y11312 Hs.132463 pht^hoinosffidEsWonase. class 2, beta 5 

104659 AW969769 H3.1Q52D1 ESTs 5 

104677 AA009764 Hs.190380 ESTs 10 

5 104711 AA017245 Hs32794 ESTs 10 

104731 AA0193QD Hs.125070 ESTs, Moderately similar to I54374 gene 10 

104764 AKB9243 Hs-278585 ESTs 5 

105005 A1298208 Hs^8805 ESTs 10 

105036 AA130390 Hs25549 hypo&>eflcaipTotelnFU2D89S 10 

10 105105 R61532 H&87016 hypothefcaJ protein FU22938 5 

105231 AW970O43 H&238039 hypoeiefical protein FU 11030 5 

105239 AA221036 gb2j03H2j1 Stratagene WT2 neuronal pr 10 

105321 AM21973 Hs.169119 ESTs, WeaWy stmBar to T25731 hypofhefi 5 

105957 BE242857 Hs27021 hypdhefcal proteh FU11159 5 

15 106052 N79885 Hs£382 ESTs, Highly simfiar to T00391 hypotheti 10 

106119 AL359624 Hs.11387 K1AA1453 protein 5 

106181 AI803651 HS.19160B ESTs 10 

106194 AW976171 Hs286194 typdhe&al proteh RJ22233 5 

106283 AW85846 Hs25522 WAA1808 protein 10 

20 106379 AL042069 Hs.1 19021 DKFZP434N061 protein 10 

106451 AW235928 Hs313182 ESTs 10 

106491 AA135688 Hs.10083 Homo sapiens, done {MAG&4139786, mRNA, 10 

106700 AA906434 Hs3776 zfocfciger proton 216 5 

106782 AW054886 Hs25682 Hcwiosaplens mRNA lor K1AA1 883 protein, 10 

25 105851 AW5B623 gtrfk04gQ9.x1 NCLCGAP Jii24 Homo sapiens 5 

106870 A1983730 H&26530 senjmo^^resporise(pho^haMyl 5 

1068% A1347578 Hs.124015 hypofte&al proteh MGC2605 5 

10S954 AF128847 H&20403B lnooie%tarrme r+^eftytiransferase 5 

106991 AJ223811 H&30127 bypofliefcal protein 5 

30 107103 AI446183 Hs3572 ESTs, Highly similar to CYA5JWMAN ADENY 5 

107124 AB006532 Hi31442 RecQproteMke4 10 

107148 AI005036 Hs334305 GS1999foB 10 

107214 AF127026 Hs5394 myosin IA 10 

107242 AB02D672 Hs.175411 WAA08S5 proteh 10 

35 107331 AB05985 Hs.111805 ESTs 10 

107351 U51704 Hs323428 ESTs, MatefatelysintotoAIJU8_HlJMANA 5 

107423 W26652 Hs£163 RBI hduced putaflw kinase 1 5 

107447 W28516 Hs.19210 hypoftefcaJ protein MGC11308 10 

107451 AL042425 H&283976 hypfte8cal protein PR02389 10 

40 107453 AI092790 H&334703 hypoftefcal proteh FLH4529 5 

107459 W38002 Empfrk^ selected nomAFFXstngtapr 10 

107683 N53167 Hs.47623 ESTs 10 

107711 W96141 HS220687 ESTs 10 

107754 AA017462 HsJ269244 ESTs 10 

45 107757 BE621721 Hs2B0792 hypofefcal proteh RJ12387sfai8arto 10 

107864 AA025060 H&61246 ESTs ' 10 

107872 BE271708 HsS5110 ESTs,WeakfyslmibrtoA5S43 l-phospha 10 

107888 AA025836 Hs.191637 ESTs 5 

107997 AL049176 Hs*2223 chordhfe 10 

50 108056 AAQ43675 Hs£2633 ESTs 10 

108081 AA093668 Hs28578 musdebfod posophBa)-a» 5 

108113 AA012881 Hs72531 hypofteBca! proteh FU11836 10 

103238 AA059473 H&66783 EST 10 

108257 AA6773Z7 Hs.144269 ESTs 5 

55 108335 AA07O5OO gb2ro70Mfts1 Stratagem TOuroepfthefium 5 

108351 AA071193 gbaf79b12^1 SauesjAteaLgteraUttHPG 10 

108382 NMJK)6770 H&67726 macrophage receptor w^coOagenousstr 5 

108392 AA075124 gton£6a01s1 Sftatageneowrfancanoer 10 

108441 AA079079 gbznti7c09.s1 Stratagene coton HT29 (937 10 

60 108446 AA085383 gbzn13gD3&1 Stratagene hffT neuron (937 10 

108497 AA074897 gbczm85a05.r1 Stratagene ovarian cancer 10 

108604 AA934569 Hs.49696 ESTs 5 

108662 AFt 17646 Hs.156637 Cas^^(nwn^)edro{&n^)vba)tr 5 

108706 AA121820 Hs.74569 KIAA0842 proteh 10 

65 108738 AA126583 H&158725 ESTs 10 

108827 At273692 Hs.1 10470 ESTs 10 

109123 A1028376 Hs.73232 ESTs 10 
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109389 AA101325 Hs.86154 hypdhefeal pro^n FU12457 10 

109546 F01449 Hs36954 Homo sapiens mRNA;cONADKFZp7626123(fr 5 

109919 R40604 Hs.129539 ESTs, WeaWy stafer to MCAT JflJMAH MITOC 10 

110006 AI094674 K&30524 Hng finger protein 24 10 

5 110141 H46749 Hs*31540 ESTs 10 

110354 W22165 Hs32586 ESTs 5 

110433 AW294182 Hs-301062 UDP4+*siykdp^ 10 

110448 H51276 Hs.13526 hypoff)^ca!pfdafr)FU12688 10 

110455 H52576 glxyt85eOaj1 Soare3jaffiaLstend_N3HPG 5 

10 110540 H72639 Hs.167608 ESTs 5 

110553 H60593 Hs.124990 ESTs 10 

110976 AL044174 Hs.159526 patched (Drosophlla) homotog 10 

110987 AI753316 Hs^6034 ESTs 5 

111158 N66616 Ks.136629 H^aptoinRNA^subteloniefte repeats 5 

IS 111168 AI798376 gfrfr34b07j(1 NCLCGAP_Ov23 Homo sapiens 10 

111187 AJ224864 Hs£6B8 leukocyte membrane anQgen 5 

111307 AA641636 Hs37477 ESTs, Weakly stmBar to T46908 hypdhefi 5 

111400 R00144 Hs.189771 ESTs 10 

111498 AI168511 gb:ow9Qh09.s1 Soares JetalJterjgjJeerL 10 

20 111651 R16733 Hs^0499 ESTs 10 

111738 R26065 gb:yh39oTfts1 Soares piacenta Nh2HP Homo 5 

111803 AA593731 Hjl325823 ESTs, Moo^atelyskri8artoAIii5JHUMANA 10 

111995 R42333 H&3Q2292 ESTs 10 

112071 AL117490 Hs47225 Ras^ssodated proteh Rap1 10 

25 112204 NMJ006668 Hs^5121 cytochnx»P450,sub^46(choJester 10 

112258 R51889 Hs24990 ESTs 5 

112490 R31094 Hs^4378 ESTs 10 

112588 R77302 gfcyi75hQ8.3l Soares placenta Nb2HP Homo 10 

112654 BE618629 Hs^68809 ESTs 5 

30 112784 T98628 Hs.191290 ESTs 5 

112817 AI057205 Hs.14584 ESTs 5 

112885 AA581428 Hs£021 EST 10 

112913 T16837 Hs.4241 ESTs 5 

113149 T51588 grxyb27d&s1 Stratagene fetal spleen (9 10 

35 113174 754659 Hs301755 Homo sapiens cDHAFLH 1465 fis.doneHE 5 

113203 AA743563 Hs.10305 ESTs 5 

113299 AW207424 Hs332594 ESTs 10 

113367 N92359 Hs.14518 ESTs, Moderatefrsfcrto to A48752 free! 10 

113457 R16763 Hs268€79 ESTs 5 

40 113563 AA913635 Hs326413 Homo sapiens cDKABJ20812fis > clone AD 10 

113574 R06874 Hs£68628 ESTs, Moderately simSar to ALU1_HUMAN A 5 

113776 AT791905 HsS5549 hypomefcal protein 10 

113790 A1244311 H&26912 ESTs 10 

113807 W07586 Hs*Q45 ESTs 3 

45 113958 W86195 gfczh54e05.s1 SoaresjBtaLPver_spteeru - 10 

114211 Z39319 HS27347 EST 10 

114254 AB018263 Hs.180338 fcOTorneaosls fector receptor superfarrd 5 

114349 AA745978 H&28273 ESTs 5 

114449 AA020738 gbczB63b11.s1 Soares retina N2b4HR Homo 5 

50 114484 AA034378 H&267319 endogenous retroviral protease 5 

114576 AA065096 p^znfiOatfcsl Stratagene fibroblast {937 5 

114624 AA081507 gtezn05biar1 SfralagenehNT neuron (937 5 

114844 AA234826 H&87386 EST 5 

114906 AA234462 Hs£7350 ESTs 5 

55 115624 AKD00725 Hs3Q579 hypc^fte^ protein FU2071 8 3 

115666 AF173081 Hs.178215 Vertebrate UN7 homotog 1, Tax interacfi 5 

115712 AB020649 HsJ4569 K1AA0642 pfoteto 5 

115889 AA398841 H&39850 hypoIhetoraotBlnaJ20517 10 

115949 A1478427 Hs.43125 esophageal cancer related gene 4 prolyl 10 

60 116107 AL133916 Hs.172572 hypc*ettealprotehaJ20093 10 

116180 AA463902 Hs.13522 ESTs,WeaWysinTaartoO8022hypotr^ 5 

116267 AW968703 H&30085 hypomefitdprotebiRJ23186 5 

116291 AW410377 H&415Q2 hypomeo^p(oteteRJ21276 5 

116527 AW1 94253 H&68607 ESTs 10 

65 116659 BE314852 Hs.168694 Homo sapiens clone 23763 intfmo*mmRNA.p 5 

116708 F10528 HsTOOOl ESTs, ModerateVsfmBar to JC6169 nude 5 

117056 AWB01806 gb:rl5^IM007O-11O4Ctt<)62-gO7 UM0070 Homo 5 
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117151 
117226 
117323 
117571 
117624 
117673 
117847 
117877 
117919 
118049 
118413 
118613 
118664 
118858 
118902 
119039 
119159 
119216 
119317 
119366 
119378 
119528 
119792 
119600 
119817 
119835 
119923 
119961 
120379 
120931 
121037 
121282 
121382 
121764 
122034 
122441 
122756 
122771 
123601 
123623 
123941 
124215 
124276 
124680 
125099 
125121 
125188 
125284 
125906 
128484 
128511 
128538 
128606 
128850 
128070 
128903 
128931 
129001 
129091 
129101 
129146 
129213 
129228 
129265 
129285 
129346 
129368 



AJ803656 

N20468 

A1472863 

N34417 

N26627 

N4Q551 

N49285 

AW263476 

BE222341 

N53145 

AW9556S6 

AXI76236 

N70907 

AL122040 

AA9935Z7 

A1160570 

AF142419 

AA514422 

AKD02001 

T77892 

T81824 

W38051 

AL049798 

AFD86332 

AF088061 

AF0B6429 

AW803308 

U34249 

AL042725 

AW136934 

AA907743 

AA401685 

AA405763 

AA421452 

AK000229 

AA447555 

AA458945 

AW135093 

AA609122 

AI024595 

AA621529 

H62570 

H83465 

AKD01527 

NNL014312 

T98199 

BE299567 

NNL0Q2666 

BE256206 

AA485421 

NNL0Q2250 

R44214 

C16161 

AA193106 

H39537 

AW50717 



Hs.42373 ESTs 



AA443323 

AA056483 

NMJW34Q3 

AL117472 

A1146494 

U40714 

AA530892 

BE617015 

AF110141 

NM.003877 



Hs.43387 

Hs.44584 

Hs42364 

Ks.184043 

H&1B2391 

Hs.44268 

H&279472 

H&90960 

Hs.49688 

H&230619 

HHQ2981 

H&293907 

Ha252097 

Hs.15020 

1&221849 

H&51305 

Hs.90949 

Hs£0552 

Hs58314 

Hs.159690 

Ha58429 

H&62954 

K&337461 

HsJ7162 

Hs.142373 

Hs*7334 

Hs.111939 

Ha.164851 

Hs38017 

HS&116 

Ks£5898 

H&97282 

H&112645 

H&97508 



Hs.163953 

Hs.112377 

Hs.48403 

HsJ71749 

Hs.103253 

Hs.17775 

H&270503 

Hs.10082 

Hs.101189 

Hs283040 

Hs.180817 

H&75309 

H&296176 

Hs.107242 

Hs.107812 

H&301463 

Hs.106665 

Hs.108924 

Hs.109525 

H&239307 

Hs.171695 

Hs.11006 

H&28890B 

Hs.110776 



gb.-yx39M0.s1 Soares melanocyte 2NbHM Ho 

ESTs 

ESTs 

ESTs, Weakly sfrnQar to JC4124 pregnancy 
Homo sapiens Ets-1 binding protein (E1B) 
ESTs 



ESTs 

gbcyv5509^1 Soares fetal Dver spleen 

ESTs 

ESTs 

EST 

Homo sapiens mRNA; cONA DKFZp434Gig72 (f 
hypofteifcal protein RJ23403 
pregnancy specffic beta-1-g!ycoprtrfeJn 6 
homotog of mouse quaking OKI (KH domafo 
ESTs 

wmaf musaibapoKurofic ffl>rosarcoma (a 
glxyd20fD4^1 Soares fetal fiver spleen 
EST 

EmpWcair/ selected from AFFX stogie pr 

dermatoporrBn 

ESTs 

ESTs 

ESTs 

terrain, teavypoJypepfide 1 

Human putative zinc finger protefri {ZNFB 

gbDKFZp434B1822j1 434 (synonym: htes3) 

ESTs 

ESTs 

ESTs 

Homo sapiens cDNA FU20470 fis, clone KA 
ESTs, WeaWy stmSar to KIAA0925 protein 
Homo sapiens cDNA FU2Q222 fls, clone CO 
EST 
ESTs 

ESTs, Highly stmSar to GIOOJiUMAH 110 K 
Homo sapiens mRNA; cONA DKFZp434D2472 (f 
a dbtotegrih and roetaBoproteJnasa doma 
gtea*47a023l Sc«r8S.tota|JetoJ<b2HF8. 
gbcyr44aOiJl Soares fetal Dwr spleen 
gbcysdIalUI Soares reSna N2b5r« Homo 
hypoliefical protein FU10665 
corffc at thymocyte receptor (X. bevis 
hypomefica) protein FU10847 
ESTs, Moderately state to AUHLHUMAN A 



p75OTfeassocsated eel deafii executor; o 
ESTs, Weakly stmtar to ALU7 JfUMAN ALU S 
potassium Intermediate^*** *?^ conductance 
ESTs 

hypc4heffca) protein PR02543 

chromosome 11 open reacflhg frame 23 

eiikdryo5ctranslaSonek)nga8onfoctor 

STAT fcduced STAT Hdftor 3 

Homo sapiens cONA RJ12965 fis, done NT 

BPOZ protein 

Human Chromosome 16 BAC clone CIT987SK-A 
zhedto 

SH^dornato protein 5 (ponsin) 

ESTs, Weakry simaar to (RX2JiUMAN tROQU 

tyrosyl-fflNA synthetase 

dual spedScfly phosphatase 1 

ESTs, Moderately similar to T17372 ptasm 

WAS protein ferity, member 2 

STAT induced STAT tnhIbftor-2 
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65 



128371 X06828 
129381 AW245805 

129440 W37944 

129441 BE061069 
129516 AFQ20038 
129554 BE222078 
129684 BB622468 
129702 AI304966 
129778 AKD01676 
13893 AKD00956 
13928 AB38993 
129973 AJ251760 
129977 NM.00Q399 
130014 NM.001158 
130085 M62402 
130089 AA452006 
130162 W80711 
130243 D88435 
13)315 AI241084 
130339 AM35746 
130400 V00517 
130436 NML001928 
130478 X72308 
130480 BE222978 
130494 AW390834 
130563 BE270472 
130589 AL1 10226 
130606 A1652143 
130634 AI769067 
130683 AA993269 
130689 NML006691 
130716 AA232075 
130718 AF2634G2 
130722 N41322 
130798 M81349 
130840 BE048821 
131184 AB040935 
131261 AA360419 
131282 X03350 
131328 AW939251 

131340 AKD00393 

131341 AF110908 
131406 H83294 
131489 BE394648 
131543 AW966881 
131692 BE559681 
131753 AA829286 
131756 AA443966 
131785 H69342 
131815 AAQ21258 
131819 BE244961 
131828 AJ00Q263 
131888 AW294659 
131927 AJ003112 
131949 AKD00010 
132115 H81604 
132177 X80818 
132296 AA467752 
132426 AW118072 
132477 S68874 
132675 AI291496 
132796 NM.006283 
132898 W28548 
132905 NM.004235 
132953 BE175645 
133116 BE563966 
133120 NM.003278 



Hs.110802 
Hs.110903 
Hs.4007 
Hs.301943 
Hs.11223 
Hs.113069 
Hs.11924 
Hs.12035 
Hs.12457 
Hs.13209 
Hs.134535 
H&273385 
Hs.1395 
Hs.143102 
Hs774313 
Hs333199 
Hs319946 
HS.1532Z7 
Hs.154353 

Hs^83108 

Hs.155597 

Hs351526 

Hs. 15760 

Hs.75874 

Hs^799G0 

Hs.16441 

Hs£88382 

Hs.127824 

Hs.17872 

Hs.17917 

Hs.18259 

Hs.18376 

Hs.18441 

Hs.1955 

H&20144 

Ks33954 

Hs.171776 

Hs.4 

H&25647 

H&25817 

t&297660 

H&284122 

H3L27414 

Hs.41639 

K&30736 

H&332053 

KL31595 

H&26320 

H&3Z753 

Ha.173103 

Hs278658 

H&34Q54 

H&34780 

Hs^58798 

Hs.178471 

Hs.178078 

Hs.195161 

Hs*9981 

Hs.170917 

HsM76 

Hs.173159 

HS224829 

Hs7934 

Hs321264 

H&6529 

Hs£5424 



vonWiflebrandfactor 

daudin 5 (transmemtoane protein deleted 



WAA0467 protein 

isocftrate dehydrogenase 1{NADP4sotu 
ESTs 

ESTs. WeaHy suite to 138022 hypotheB 
ESTs, WeaWy similar to 138022 hypothefl 
hypothetical protein RJ10814 
hypothetical {xotski FU10094 
ESTs 

guaruhe nucteoflde bfcotng protein (G pr 
eanVgraf^respc»Be2(ito-20(DrTO^ 
amine ootWase, copper containing 2 (refi 
frisuBiv^giowfo facta bMrigprote 
ESTs 

Homo sapiens mRNA for WAA1727 protein, 
cycSn G associated kinase 
nonseiedivesodhBnpotassajm^^^ 
gb2f79e03^1 Scares Jestis.NHr Homo sap 
hemoglobin, gamma G 
D cornponerrt crfcompterrient (aaTps&i) 
smal Wucfcte cytokine A7 {monocyte ch 
MYG1 protein 

pregnancy-associated plasma protein A 

HSPC015pioteiR 

DKF2P434H204 protein 

rn^efeaJ protein RJ13111 

ESTs, WeaWy stnrribr toT28770 hypo&ieti 

Homo sapiens, done 1MA6E3875012, mRNA 

extracafiuiar Snfc dornainKX)ntaining 1 

XPA binding protein 1; putative ATPfGTP) 

K1AA1319 protein 

ESTs 

serum amytoid A4 t constitutive 

smal fottodble cytokine subfemtyAfCy 

cerebral ceiacftesionrncdeane 

irositol(mYoMM 1 

alcohol dehydrogenase 1B (class I), beta 

v-fos FBJ murine osteosarcoma vnal onco 

BTB(F^ domain conta&iing 2 

TNF receptor-associated factor 3 

VWhHoitoiyfector-1 

hypothetical protein 

piogranvned eel death 2 

WAA0124 protein 

serum amyWdAI 

ESTs 

TRABK) protein 
ESTs 

FE65-UKE2 

keratin, hair, basic, 6 (mcdlethrix) 

Homo sapiens cDNA: FU22488 lis, clone H 

doubtecotesBssertcephafc^ 

hypoteScal prote&i FU2D003 

WAA0798 gene product 

gtutamate receptor, metabotropic 4 

ESTs 

dbcyigiycerol kinase, zeta (104kD) 
pros^toKfln E receptor 3 (subtype EP3) 
Homo sapiens, ckmeftttGE353M23.inRNA. 
trartsfonrong, adoTccolRfkog contain 
ESTs 

KruppeWke factor 4 (gut) 
U3P protein 32 

ESTs, WeaHy similar to 178885 sabw/m 
tetranectin (ptasmtoooerttinalr^ 

104 
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5 
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10 
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10 

10 
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133139 
133163 
133268 
133272 
133379 
133407 
133552 
133702 
133719 
133731 
133789 
134007 
134055 
134111 
134117 
134177 
134308 
134361 
134369 
134449 
134467 
134496 
134510 
134550 
134577 
134591 
134678 
134728 
134758 
134786 
134912 
134963 
134969 
135001 
135066 
135173 
135197 
135219 
135250 
135304 
135337 
135417 
101367 
128870 
129381 
130085 
130689 
133120 
133407 
133731 
134369 
135086 
135173 
322580 
408790 
418043 
427458 
446674 



AF052138 Hs.6580 

AA668224 Hs.6634 

AW956781 K&2S3937 

NMJH2776 Hs.69423 
AA207059 



AF017987 

H21497 

L02321 

H26904 

N71725 



AP072441 

D86062 

AQ72588 

AA081846 

BE243319 



Hs.7306 

Hs.7471 

H&75652 

HsJ5736 

H&272572 

Hs.76239 

Hs.7840 

Hs.182423 

Hs£022 

Hs.7921 

Hs.79672 



AW905827 Ha81454 

BE549343 H&82208 

AF207664 Hs*230 

L34155 Hs*3450 

AJ190413 Ha8373 
M64936 

NMJ002757 H&25087O 

M26315 Hs£5258 



BE244323 
U73394 
AL008583 
D10216 



Hs£5951 
Hs.166085 
Hs.182595 
H&89394 



NM.000078 Hs.89538 

T29618 Hs*9640 

187521 Hst261457 

NM.003394 H&91985 

H22570 Hs.172572 



AA3Q2517 

XD4430 

AUJ36557 

U76456 

AB002361 

U83171 



H&9Z732 

H&93913 

Ha95910 

Hs.190787 

K&96633 

K&97203 



AM16829 Hs.191597 
AA905408 H&99Q5 



X55019 
X033SO 
H39537 



H&99975 
Hs.4 

H&75309 



AW245605 Hs.110903 

M62402 H&274313 

NM.006691 Hs.17917 

NM.003278 Hi65424 

AP017387 Hs.7306 



N7172S 

AF207664 

XD4430 

AL036557 

AKD01852 



Hs.272572 

H&B230 

H&93913 

Hsj95910 

Hs374151 



Homo sapiens cDNA: FU23227 fis. done C 

Homo sapiens cDNA: FU22547 fis, clone H 

ESTs, WeaWy simitar to FXD2JiUMAN FORKH 

ka&reinlO (KLK10)(PRSSL1)(nes1) 

gb2q80h09.s1 Stratagene hNT neuron (937 

secreted frizzJed-related protein 1 

BBP-tite protein 1 

glutathione ^transferase M5 

apafjpopiotBfti D 

hemogtobh, alpha 2 

hypothetical piolnii FU20608 

calc&neurin btndiig protein 1 

ESI (zebiafish) protein, human homotog o 

TU3Aproteh 

Homo sapiens mRNA; cDNA DKFZp566E183 (fr 
KlAA0652o^ product 
tetohcjtoHha^e (buctokinase) 
acyL&enzyme A dehydrogenase 
a disaitegrin-lte and metaBoprotease ( 
bmWn, alpha 3 (nfceto (150kD), fetal 
ESTs 

gfcHomo sapiens reflnofc add-frtducfirfe 
mtDQen-acSvated protein Idrase kinase 
CD8 antigen, afcha polypeptide (p32) 
exportin, tRNA (nuclear export receptor 
HDercefl lmmunogtobuftfrlke receptor 
dyneh, axonemat Dght polypeptide 4 
POU domain, class 1, transcription facto 
cholesteryt ester transfer proteh, ptas 
TEK tyrosine kinase, eratomefal (venous 
ESTs 

wingtess-type MMTV integration sSe faird 

hypothetical proteto FU20093 

K1AA1444 protein 

tnterteukin 6 (hterferon, beta 2) 

puta^rymphocyte(^1swtoo^fte 

fissue imBAor of metaSoprofesnase 4 

WAA0363 protein 

small fnducfbte cytokine stittarnflyAfCy 
ESTs 

ESTs, WeaWy simflaf to unnamed protein 
cfto&nergic receptor, nicotinic, delta p 
alcohol deliyifcuyenase 1B (class I), beta 
eukaryofc UansfaBun etongatton factor 
ctaucun d (oansroemDrane protein oesso 
EnsuSn-Sce growth factor btodtng prote 
extraceflular Ink domatrKarrtairdng 1 
tetranectin (plasnAwgen-binding protein 
secretes nsaecHeiated protani 
hemogbUn, alpha 2 
a disintegrfn-Qke and rnetalloprotease ( 
tnterteukin 6 (interferon, beta 2) - 
pufafive rymphocyte G0K31 sw&ch gene 



AW560227 Hs.47860 
AW377752 HsJ3341 



BE208364 
AA563892 
U85642 
RCLH15814js 
Ya024w/RIP1 



H&29283 
H&3Q6000 
Hs.138506 
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5 
10 
10 
5 
5 
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3 
10 
5 
5 
5 
5 
10 
3 
5 
5 
5 
10 
5 
5 

neuftfHififttefyrDstekEna 10 
AXL receptor lyrosireldnase 5 
ESTs, WeaMy sknSar to LKHU proteoglycan Bnk 5 
solute earner faraBy 4 (artton exchanger), memo 10 
ESTs 5 
HurrraapM1rr#ttiAforGS3109(!K>^ 
EST-YR024WRIP1 3 
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TABLE 1 A 

Table 1 A shows the accession numbers for those pkeys lacking unigenelD's for Table 1 . 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Ptey: Unique Bos probeset tdenffer number 

CAT number Gene duster number 
Accession: Genbank accession numbeis 



Pfcey CAT Number Accessions 

108446 112224 1 AA085383 AA126091 AA074174 AA075373 AA07912O AA070831 AA075978 AAD75372 AA128503 
108497 110079 2 AA074837 AA113914 AAD64871 AA079329 AA071309 AAD84710 M129030 AA075O42 AA074794 AA071453 AA078803 
AA148628 AA122204 AA074159 AA126185 AA0791 17 AA127Q89 AA070912 AA079280 AA131372 AA078833 AA071O87 
AA076131 AA071047AA079401 AA083070 AA102O76 M1 15163 AA074198 AA134725 AA1 13889 AA121 103 AAD75041 
AA065148 AA071310 AA101 144 AA079659 AA078S31 AAO79209 AA07O928 AA068994 AA069B17 AA078187 AA069053 
M131489AA071308AA063317AA070158AA071430AAD76055AA07^ 
AAO792O8AAO74583AAO71O86AA079623AAO7O627AAD788Q2AAO766# 

AA148748 AAO79230 AA0B5188 AA074485 AA07O580 AA076151 AAD83166 AA0851 18 AA079450 AAC85044 AA120938 
AAO792D0 AA1 00188 AA081472 AA122355 AA129031 AA085362 AA069220 AA070940 AA075968 AA074563 AA084027 
AA115929 

124215 1597154 J H62S70 169063 

117058 1219924 J AW801806 H90434 BE086530 

110455 46874 1 H52576AF085971 H52172 

111168 38585 1 AI798376 S46400 AW811617 AWB11616 W00557 BE142245 AW858232 AW861851 AW858362AA232351 AA218567 
AA055556 AW858231 AW857541 AW814172 H66214 AW814398 AF134164 AA243093 AA173345 AA189942 AA223384 
AA227092 AA227080 T12379 AA092174 TB1 139 AA149776 AA699829 AW879188 AW81 3567 AW81 3538 AI267168 
AA157718 AA157719 AA100472 AA100774 AA130756 AA157705 AA157730 AA157715 AA053524 AW849581 AW854566 
C05254 AW882836 T92637 AW812621 AA20G583 AA209204 BE156909 AA226824 AI829309 AW991957 K66951 AA5Z737 4 
K6G21 5 AA045564 AK94265 H60808 AA149726 AW195620 BE081333 BE073424 AW817662 AW817705 AW817703 
AW817659 BB081531 H58570 

111498 411008 1 AI168511 AW22712AA700366R07371 R07324 

104340 46289 10 AM2B1B9 F15201 

103747 117944 1 AA081995 AA101099 

134496 46501.1 M64936 A1Q25512 AI382987 BB061777 AA089966 BE1 69930 T41 176 AW594624 BE50241 5 AA121893 A1269283 T4031 1 

AW79277 A241318 BE327710 AW975215 AW898268 AA884990 BE327514 

103750 118365 1 AA126129 AA126033 AA082561 

105239 34624 T AA221036R87170BE537068BE544757C18935AW812058TB2565AA2Z7415AA 

AW868639 BB061833 BE00062Q AW961170 AW847519 AA308542 AW821833 AW945688 C04699 AA205504 AA377241 
AW821667 AA055720 AW817981 AW856458 AA155719 AA179928 T03007 AW754298 AA227407 AA113928 AA307904 

C1S859 

AL042725 BE063316 AW975610 AA457591 BE062092 AI6552Q2 AA714296 AI267264 AJ075321 AA223286AA071122 
AA227849 AA216700 AB96002 AA101B67 AA099426 AA135997 AL041698 T02815 T51824 AA207189 T59230 T51868 
AA663341 BE165757 AW818104 AW392886 AA584918 AA0994C8 AW856396 AW881859 AA053045 
114624 111686 1 AA081507 AA07DO71 AA070840AA084362 
106851 322947 J AI458623AA639706AA4854Q9R22065AA48557D 
108392 113549 1 AAO75124AA0752O8 
100545 229S 11 M55405AW752552 

100654 BgrJfT2969 A03758 A06977 A15293 D17029 D17107 D17171 L00132 100133 Ml 2523 M13075 M13076 h^2S16 U22951 V00494 V00495 

X513S3 X51364 X51365 
100702 tior HT3413 L27065 

102208 6735_9 U22961 AA203623 AA5Q3337 AI174733 AI192802 C06092 AA035357 AI190619 A1199244 A1828450 AA602298 AG78195 
M209170 A1186653 AI127795 AI183846 H77389 A1589465 AA629390 H94306 AI018388 R68584 AA027196 AI745413 
A1685092 AI093426 AI623673 AI074570 N50096 AA047486 N25060 AA327614 AW42512 A1383957 AA156873 AB33101 
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AB84569AA257011 
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N70806 A1141254 A1383191 AM01237 AI080709 AKB3400 W84549 T90806 RD0012 W01413 AA630557 AB78348 A1559265 
AA877103 W84464 AA62S146 R68379 AI1332D7 AI132980 AJ133214 AK64826 A1061615 AI133473 AI174852 A1133404 
AI133272 V00494 M12523 M12523 AI207526 A1133120 AI064802 AI174993 AI1 14729 AKJ51645 A10S4716 AI064959 H77388 
T85706 AF075298 AM 10799 D17107 NM_000477 AF190168 R50724 AJ248416 AE07432 AI133684 AI133345 AI174710 
AM33290AM33304AI174948AI207484Att10717AF074624A!114515 

A1207568 AH64960 A!174753 All 14666 R69184 R00011 AI064997 T60501 A1207701 T71735 AA385318 H73569 T60496 
H94399 A1133158 T74675 AA484750 773413 T55909 R50261 T72061 N80533 751189 T74936 AI207490AI1 32925 AJ064701 
AJ174748 AH 14663 A1133104 AI132999 AH33100 ATO64925 M064979 AI133063 AA343347 769091 AA233989 T39772 
M444620 752290^6931740012748403 758926 7691 95 AI133061 750850 

N56507 787485 AM 33622 AA343370 740075 76967 1 T53849 774820 AFC75316 AI110818 T40121 T57381 A1 114468 
AA332728 751362 AM14589 R06691 AH 10629 AF063503 AI140543 AA334661 AA332720AA343262T73513 7B6549 
All 14840 757284 T39981 T61407 T72757 T51749 T56630 AA343125 T72126 R94135 T83Q28 739972 739896 AM74786 
AI132926 R09237 AH)64838 AM33660 760398 T88753 755930 792126 AJ444602 760996 AI1 14732 H9391 1 AI133106 
R10779 AI065020 790925 750889 D17Q29 A1133703 AA333805 A1133040 AH33017 AI064857 AI1 10730 AF074637 AI207567 
H71080 773217 AA343950 AI174743 AA334224 AA334281 R06692 764739 7401 63 760628 781661 773179 R01842 
AA501730 739931 739662 T40136 AA334904 771425 K77784 R0C874 AI065049 784512 755918 ACQ7595 739951 
AA005016 760361 769176 773356 758795 761233 739955 760612 AI114676 AI064778 AA035710 W52763 AI114786 
783564 AA341859 781684 755769 AI114710 751776 AA343213 AI114714 758102 AI1 10809 R28984 AI174854 AA305675 
AA343592 753836 746869 764721 755508 W05241 754019 157945 76051 3 T48364 AF0753O8 W88731 782851 T48269 
H54053 77321 1 AM 14590 748317 755S65 774857 R84226 756552 752231 774946 776976 K02576 795666 AEQ3974 
AM89471 AA005147 AI4781Q2 AJ207662 AI192792 A/768421 AB64737 AW051713 AA936693 AM331 17 Aff6fi232 AS13646 
783962 A1065112 A1207689 AM74684 AB07702 781475 AI133325 AIQ32512 AA701169 A1936354 AM14720 A1433289 
AA046980 A1823482 AM 14536 AA860651 AW242644 R07469 AW30O438 AI13341 6 AW271 670 A1991363 778943 AI823481 
AAB45518 AA719124 AA883454 768850 T691 15 AI935509 AI150977 762890 T71374 768294 AI174774 76741 1 T68318 
AI064689 756624 769010 758982 768302 AB32829 772908 AI064819AB05880 762895769430795111 AA025050 773330 
W52657 771984 769118 W92684 A1114860 762093 761797 AB22333 773322 K92981 756018 T61811 757232 AI336158 
761821 769457 762900762912 772917 746885 AI702448 757212 757203 R94581 771311 761819785358 757708 770918 
T59166 AI1871 1 1 T64308 762071 769427 AI114750 760430 RD9734 760)33 T69141 T69453 767908 R16809 769394 
AK07729 755839T90273T73339AW194909775486 771850 771305771287 753877 773452 T68852 N75290AI312890 
767751 AM74983T51679T54851 H69880N73734 AA443453 773466 H69672N53869 768447 D1 1809 D12412T64300 
728321755884 



123941 


genbank J AA621529 


AA621529 


118049 


genbankJ453145 


N53145 


102800 


1478^20 AA313538U88895 U88902 


104106 


AA422123jLaiAA422123J 


111738 


gcnbanK_R26065 


R26065 


113149 


gehbanKJ51588 


751588 


113958 


genbankJW86195 


W86195 


108335 


genbanK^AA070500 


AAO7O50O 


108351 


genbanlLAA0F71193 


AA071193 


108441 


genbanK_AA079079 


AA079O79 


124276 


gentenKJB3465 


H83465 


101447 


ente;_M21305 


M213Q5 


117226 


genbanKJC0468 


N20468 


133379 


genban)LAA207059 


AA207059.AA207241 


119366 


genbanKJ77892 


777892 


119528 


NOT _FOUND^entre?_W38051 W38051 


112588 


genbanXJ*77302 


R77302 


114449 


genban)LAA020736 


AA020736 


114576 


genbanK_AA065096 


AA065096 


107459 


W38002JUrt W38002JB 


130339 


genbanRJ\A435746 


AA435746 
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TABLE 2: Figure 2 from BRCA 001 US 
5 Table 2 shows genes downregulated in tumor tissue compared to normal breast tissue. 



Ptey: Unique Bos probesei identifier number 

ExAocn: Exemplar Accession number, Genbank accession number 

10 UnfeenelD: Unlgene number 

UnfceneTHJe: Unlgene gene file 

R1: Ratio of normal breast tissue to tumor 



Pkey 


ExAccn 


UrugeneH) 


Unlgene Tfttie i 


R1 


100499 


T51986 


H&283108 




10 


100549 


BE142Q19 


Hs£22056 


Homo sapiens cONA FLI1 1572 fis, done HE 


10 




A03758 




NM fWfU77*Hnmn amfang athirmin fAI ffl m 


10 


100971 


BP379727 


Hs.83213 


fatty add bfrxWng proteh 4, ad^ocyte 


10 


101184 


NM 001674 


Hs.460 


activating tnmscnpBon factor 3 


10 


101336 


NM 006739 


H«TCR7ft 

|t0»F wf V 


PRJ minnA osfnnsfHmnvi vitsi tmcnosnfl n 

1 IIMllv VHWWlllMHlal rucu winA^yvflv ll 


10 


lUIOOf 




Hs.4 


almhnl rtphvrinxiecssfi 1B f class ft. hda 


10 


101447 


M21305 




gfcHiffnan alpha satpffite and satefifte 3 


10 


101461 


N98569 


HsJ6422 


ptosphcfyase A2, group IIA (platelets, 


10 


101511 


MZ7626 


H&267319 


endogenous fetrovtal protease 


10 


101736 


M74447 


Hs£02 


transporter 2!, ATP-bhcDng cassette, sub-fam2 


10 


102208 


U22961 




gteHunffn mRNA done wttfr stmBarfly to L 


10 


102450 


U48251 


Hs.75871 


proteki kinase C bfxfing protein 1 


10 


102800 


AA313538 




gb£ST185419 Cohm carcinoma (HCC) cefi 


10 


1Q2857 


NMJD06744 


Hs.76461 


letinolttuung protein 4, inttsstflia) 


10 


102990 


AA829286 


Hs332053 


serum amyloid A1 


10 


103747 


AA081995 




gbczn26d06 j1 S&atagene neuroepHhefium 


10 


103812 


AA137107 


HS&6391 


Homo sapiens, done MG&16638, mRNA, com 


10 


104093 


R50727 


H&336970 


ESTs 


10 


104109 


AL353957 


HsJZ84181 


hypdh^protoDKFZp434P0531 


10 


104250 


F06638 


Hs.12440 


Homo sapiens done 24734 mRNA sequence 


10 


104492 


N73185 


HSS4285 


EST 


10 


104506 


N91071 


Hs.109650 


ESTs 


10 


104532 


AI498763 


HSJ203013 


hypomeScal protein FLM2748 


10 


104677 


AA0Q9764 


H3.190380 


ESTs 


10 


104711 


AA017245 


H&32794 


ESTs 


10 


104731 


AA019300 


Ks.125070 


ESTs, Moderately stater to B4374 gene 


10 


105005 


AI298208 


Hs£8805 


ESTs 


10 


105036 


AA130390 


Hs25549 


hypoOtefical protein FU20898 


10 


105239 


AA221Q36 




gbczi03r12j1 StratageneOT2 neuronal pr 


10 


106052 


N79885 


Hs.6382 


ESTs, Highly simBarto T00391 hypomeB 


10 


106181 


AI603651 


Hs.191608 


ESTs 


10 


106283 


AI085846 


Hs^5522 


KtAA1808pmteln 


10 


106379 


AUM2Q69 


Hs.119021 


DKFZP434N061protefci 


10 


106451 


AW235928 


HsJ313182 


ESTs 


10 


106491 


AA135688 


Hs.10083 


Homo sapiens, done IMAG&41397B6, mRNA. 


10 


106782 


AWQ54886 


Hs^5682 


Homo sapiens mRNA for HAA1863 protein, 


10 


107124 


AB006532 


HsJ31442 


RecQprotein-fikB4 


10 


107148 


AI005036 


HSJ34305 


GS1999tul 


10 


107214 


AF127026 


rfe5394 


myosin IA 


10 


107242 


AB020672 


Hs.175411 


KlAA08&5pmtefiFi 


10 


107331 


AI905985 


Hs.111805 


ESTs 


10 


107447 


W28516 


Ks.19210 


hypotheflcal pariah MGC11308 


10 


107451 


AU042425 


t&283976 


hypmetel protefe PRQ2389 


10 


107872 


BE271708 


Hsw95110 


ESTs,WeaWysbTinartoA559431i)hospha 


10 


108351 


AA071193 




gbzf79M2£l Soaies.pJneal^landJCHPG 


10 


109546 


F01449 


Hs3S854 


Homo sapiens mRNA; cONA DKFZp7626123 (ff 


10 


110433 


AW294162 


Hs301Q62 


UDP4+*£tyl*Ir^^ 


10 


110976 


AL044174 


Hs.159526 


patched (DrosophQa) hcmotog 


5 
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111168 AI738376 gh*34bfl7jc1 NCLC6AP_Ov23 Homo sapiens 10 

1H651 R16733 Hs.20499 ESTs 10 

111803 AA593731 Hs^25823 ESTs, Moderately sinter to ALU5jflJMAN A 10 

114484 AAQ34378 HsJ67319 endogenous retroviral protease 10 

5 125284 NMJ0Q2666 Hs.103253 pedDp^i 10 

128850 AA193106 Hs.180817 chrornosome 11 open reading name 23 5 

128903 AW150717 H&296176 STAT induced STAT hhtotof 3 10 

129346 AF1 10141 HsJ288908 WAS proten family, member2 10 

129381 AW245805 Ha.110903 daxfin 5 (trarcme^ 10 

10 129516 AFO20038 Hs.11223 feoritrate dehydrogenase 1 (NADFH soJu 10 

13554 6E222078 Hs.113069 ESTs 10 

130085 M62402 Hs^74313 hsufovtep^owfo factor bh^ 10 

130243 DB8435 Hs.153227 cycffinG associated tdnase 10 

130400 V00517 Hs^83108 hemogbbh, gamma 6 10 

15 130436 NM.001928 Hs.155597 DcoiTpcnentofcojnp^ 10 

130563 BE270472 H&279900 HSPC015 protein 10 

130589 All 10226 Hs.16441 DKFZP434H204 proteh 10 

130683 AA993269 Hs.17572 Homo sapiens, done IMAGE3875012, mRNA 10 

130689 NM.006691 Hs.17917 extracellular fink oWiirKarrtaWng 1 10 

20 130689 AA046747 Hs.17917 extraceQu)ar^oWi!rHxmyning 1 10 

130718 N70196 Hs.18376 WAA1319 protein 10 

130798 MB1349 Hs.1955 seromarm^ldM.ccralhiithre 10 

130840 BE048821 Hs^0144 sroaflirctudbiecytokte 10 

131164 AB040935 Hs3»54 cerebialcefl adhesion molecule 10 

25 131282 XD3350 Ha.4 aJcahd ctehydrogenase 1B (class I), beta 10 

131328 AW939251 Hs^5647 v4reFBJnwnne()steosaixara viral ora 10 

131543 AW966881 Hs.41639 pfl^ammedceldeafii2 10 

131753 AA829286 H&332053 serum arnytoidAI 10 

131785 H69342 Ha^6320 THABD protein 10 

30 131828 AJ000263 Hs378G58 kerafin. hair, basic. 6 (monBeMx) 10 

132426 AW118072 Hs*89981 cfBcylglycerol kinase, zeta (104kD) 10 

132675 A1291496 H&5476 Homo sapiens, done IMAGE3530123. mRNA, 10 

132898 W28548 r&224829 ESTs 10 

132905 KMJ504235 Hs7934 KruppeUke{actor4^uQ 10 

35 133120 NNL003278 Hi65424 teiranecSn (#a»nhc^ 10 

133407 AF017987 Hs.7306 secreted frfzztetkeiated proteh 1 10 

133719 H26904 HsJ5736 apoipoprotein D 10 

134007 AF072441 Ks.7840 catemeurh binding proteh 1 10 

134055 D86062 Hs.182423 ESI (zebrafish) proteh, human homotog o 10 

40 134111 AI372588 H&8022 TU3A proteh 5 

134117 AA081846 Hs.7921 Hanosaptomf^cDNAOKFZp566E183^ 5 

134177 BE243319 Hs.79672 KlAA0652gerte product 10 

134369 AF207664 Hs^230 ac&nlegn^^andmeta^^ 10 

134496 M64936 c^kirosaptere^ackMnc^^ 10 

45 134510 NNL002757 Hs^50870 rrdop^adK^prclehkhasela^ 10 

134550 M26315 H&6525B C08 andgen, alpha polypepBcto (p32) 5 

134758 NKL000078 Hsl89538 chdesteryl ester transfer proteh, phs 5 

134963 NMJXB394 H&91985 whoJess^h^hteQ^ste&ni 10 

135066 X04430 HsS3913 mterleukh 6 frierferon, beta 2) 10 

50 408790 AW580227 Hs.47860 neurrticiihicfyrcdhel^^ 10 

446674 AA563892 Hs.306000 solute carrier fanty4(artoexcrKmger),rremb 10 
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TABLE 2 A 
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Table 2A shows the accession numbers for those pkeys lacking unigenelD's for Table 2. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession'* column. 

Pfcey: Unique Eos probesst bfsnfifter number 

CAT number Gene duster number 
Aooessbn: Genbank secession numbers 



Pttey 



CAT number Accessions 



111168 38585.1 



1(0747 
134498 



117344.1 
4S501J 



105239 34624 1 



100654 



102208 6735_9 



AI798376 S464C0 AW811617 AW81 1616 WC0557 BE142245 AW858232 AW861851 AW858362 AA232351 AA218557 
AA055556 AW858231 AWB57541 AW814172 H6S214 AW814398 AF134164 AA243093 M173345 AA199942 AA223384 
AA227092 AA227080 T12379 AA092174 761139 AA149776 AA699829 AW879188 AW813567 AW813538 AE67168 AA157718 * 
AA157719 AA1C0472 AA1C0774 AA13075S AA157705 AA157730 AA157715 AA053524 AW849581 AW854566 C05254 
A W8B2836 792637 AW812621 AA20S583 AA209204 BE156909 AA226824 AB293Q9 AW991957 N66951 AA527374 H6&215 
AA0455S4 A1694255 H60808 AA149728 AW195620 BE081333 BE073424 AW817662 AW817705 AW817703 AW817659 
BHJ81531 H5S570 
AA081S95AA101099 

MS4936 AJQ25512 A13B2387 BE081777 AA089966 BE169930 T41 176 AW594624 BK02415 AA121893 AI269283 T4031 1 
A1684569 AA257011 AI079277 AB41318 BE3Z7710AW975215 AW8S8268 AA884930 BE327514 
AA221G36 R87170 BE537088 BE544757 C18935 AW812058 T92565 AA2Z7415 AA233S42 AA223237 AA668403 AA601627 
AW869639 BEQS1 833 BEQ00S20 AW9S1 170 AW847519 AA308542 AW821833 AW945688 C04699 AA205504 AA377241 
AW821667 AAD55720 AW817981 AW85646B AA155719 AA179928 703007 AW754298 AA227407 AA1 13928 AA307S04 
- C16859 

%r_Hra69 A03758 A06977 A15»3 D17029 D17107 D17171 LD0132 L00133 M12523 M13075 M13076 &&2816 UZ861 VD0494 VD0495 
X513S3 X51364 X5138> 

U22981 AA203623 AA503337 AM74733 AM92802 CCS092 AA035357 AI190619 AI199244 AB28450 AA6022SS AI378195 
AJ209170 AI18S653 A1127795 AH 83846 H77389 A1589465 AA6293S0 H943QS AKH8388 R8J5&4 AA027196 AS745413 
AK85092 AJ09342S A1623873 AJ074570 N5009S AA047486 N25060 AA327614 AI042512 A1383957 AA156873 AI333101 
M70806 AJ141^4 AI383191 AW01237 AI080709 AKKB4C0 W84549 T908QS R00012 W01413 AA630557 AI378348 AI559285 
AAS77103 W84464 AA625146 R68379 AJ133207 AJ132980 AM 33214 AK64826 AJ061615 AH33473 AI174852 AI133404 
AJ133272 V004&4 M12523 M12523 AI2D752& AI133120 AI0648Q2 AI174S93 AH 14729 Ai0S1645 AJ084716 AICS4959 H77388 
7B570S AF075S8 AI1107S9 D17107 mUMMM AF180168 R50724 AB48416 AI207432 AI133684 AM33345 AH74710 
AH33ZK) AI133304 AI174948 AE07484 All 10717 AF074624 AI114515 AF0S3516 A111CS42 AH 14559 AH 14498 AI114759 
AI2B7568 A06496O AI174753 AI114686 R69184 R00011 AI084997 780591 A&07701 T71735 AA385318 H735697B0495 
H94399 AI133158 T74675 AA484750 T73413 755S09 R50261 T720S1 N80533 T51 189 T74936 AB074SO AM3ZJB AKJS4701 
AI174748 AI1 14663 AI133104 AI132333 AH331C0 AEQS4925 AE0S4979 AH33QS3 AA343347 T6SQ91 AA233989 T39772 
A144462D752280D16931 740012 74840375892S 769195 AH33Q81 7^ 

W55507 T87485 AM33S22 AA343370 T4C075 76SS71 753849 774820 AF075316 AI1 10818 T40121 T57381 AI114468 
AA332728 751382 AH 14569 R0SS91 AM 10S2) AFCS3503 AI140543 AA3348>1 AA33273) AA343£>2 773513 7B6549 
AM 14840 T57284 739981 T61407 T7Z757T51749 756630 AA343125 772128 R94135 T83028 T39972 T3989S AM74786 
AI13292SR09237Al0$4838AI13366O 760398 788753 755930 792126 ^ 

R10779 AI085Q20 790925 T50889 D17Q29 AI1337Q3 AA3338G5 AI133040 AI133017 AKJS4857 AI110730 AF074637 AI207567 
H71080 T73217 AA343950 AM74743 AA334224 AA334281 RCS892 T64739 T40163 76CS28 7B1681 7731 79 R01842 
AA501730 739931 739862 T4CM35AA334S04 771425 H77mR00874AI0^ 

AA005016 160361 769176 T7335S T58795 T61233 T39955 TKJS12 AI114676 AI0S4778 AA035710 W5Z763 AI114786 
T83S4 AA341859 T81©4 T55769 AI1 14710 T51776 AA343213 AI114714 758102 AH 10809 R28984 AI1748W AA3^675 
AA343592 753836 T46869 764721 755508 W05241 754019 757345 760513 748364 AF075308 W88731 782851 T48269 
^4053 773211 AM 14550 748317 T55865 774857 R84228 756552 T52231 774346 776976 R02576 795668 AG&3974 
AI183471 AAC05147 AI4781 02 AI207662 AJ19Z792 AT7^421 AI064737 AW051713 AA93®93 AI1331 17 AI7^232 AI913646 
T839S2 AIC651 12 AI207669 AI174B4 AI207702 T81475 AI133325 AIQ3^12 AA701169 AI936354 AI114720 AI433289 
AA046980 AI823482 AI114536 AA^0651 AW242644 R074G9 AW3C0438 AJ133416 AWZ71670 AI391363 778343 AI823481 
AA845518 AA719124 AA863454 76B850 769115 AI935509 AMH»77 762890 771374 1^4 AI174774 767411 TK318 
AW54689 7K624 769010 768982 768302AI332aa 77S08 AKI64819AI205B80 71628^ 7694^)^51 11 AAQ25050 773330 
W^57 T71984 759118 W92684 AI1148Q) 762(03 761797 AI522333 T73322 H92981 75^)18 76181 1 T37232 AG36158 
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761821 T69457 T62900 762912 T72917 T46885 AI70244B T57212 T57203 R94581 T71311 T61819 T89358 T67708 T70918 
759166 Al 1871 1 1 7B4308 T62071 T69427 All 14750 TB0430 R09734 T69033 T69141 T69453 T67908 R16809 T63394 
AI207729 755839 T90273 T73339 AW194909 775486 T71B50 771 305T71287 753877 773452 768852 N75290AI312890 
767751 AI174983 751679 754851 H69880 N73734 AA443453 773466 H69872 N53869 TS8447 D1 1809 D12412 764300 
728321 755864 

102800 14782J0 AA313538 U88895 U889Q2 

108351 genbanU*A071193 AA071193 

101447 entrazJA21305 M21305 
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TABLE 3: Figure 3 from BRCA 001 US 
Table 3 shows genes downregulated in tumor tissue compared to normal breast tissue. 



Pkey: 

ExAccrc ExBmplar Accession ramt)^ 

10 UnfcenelD: Un^ene number 

Unjgene Title: Unfeene gene title 

R1: Ralfo of nomial breast Bssue to tumor 



15 



20 



25 



Pkey ExAccn UnJgemrfD UnJgenerrtte 



101336 NM.006732 

102208 U229S1 

102990 AAB29288 

111168 AI798376 

111803 AA593731 

130085 M52402 

130840 BE048821 

131543 AW966881 

13312D NM_003278 

134758 NMJH0078 



Hs,75678 

Hs332053 

H&325823 

H&274313 

Hs^0144 

Hs.41639 

Hs.65424 

H&89538 



R1 

FBJ murine osteosarcoma viral oncogene h 10J) 

gb^unranrfmAckTOwHhs&idafifytoL 10.0 

serum amytoHAI 10U0 

gWi34b07jc1 NCLCGAP_Ov23 Homo sapiens 10.0 

ESTs, Moderately dmBar to ALU5JiUMAN A 10JD 

msuftvJtogrcwth factor binding prote IOjO 

^KhK*fecfr^sub^A{Cy 10i> 

prog rainmed cefl death 2 10D 

tetranectin {plasnTtocjo^^ 10JD 

cholester/lestertrar^prDtelrupbs 10.0 
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TABLE 3 A 

Table 3A shows the accession numbers for those pkeys lacking unigeneED's for Table 3. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed Gene clusters were compiled using sequences derived from Genbank ESTs and 
ioRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (BoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 

Pfcey. Unique Eos probesei Identifier number 

GATnumbsr: Gene duster mnnbsr 

Genbank accession numbers 



Ptey CAT number Accessions 



15 

111168 38585J AI798376 S4B4O0 AW811617 AW811616 W00557 BE142245 AW358232 AWB81851 AW858352 AA232351 AA218567 

AA055555 AWB58231 AW857541 AWB14172H66214 AW814398AF134164 AA243Q93 AA173345 AA199942 AA223384 
AA227092AA227(^T12379AAQ92174T6N 
20 AA157719 AA100472 AA1C0774 AA130756 AA1577D5 AA157730 AA157715 AA053524 AW849581 AW854566 C05254 

AW8B2835 T92637 AW812621 AA2DS583 AA2092Q4 BE158809 AA226824 A1823309 AW391957 NS8951 AA5Z7374 KGG215 
AA045564 AI694265 H608Q8 AA149726 AW195620 BED81333 BHJ73424 AW8176S2 AW817705 AW817703 AW817659 
BE081531 H59570 

1Q22086735.9 - U223S1 AA203623 AA503337 A1174733 AM92802 C08Q92 AA035357 AI19CS1 9 A1199244 A1828450 AA6Q2Z95 AI378195 
25 AE09170 AI1866S3 AI127795 AM83846 H77389 AJ589465 AA629390 H943C8 AI018388 R68584 AAQ2718& AI745413 AIS85092 

AKI93425 AES23873 AKJ74570 H5C0SS AA047485 N25G60 AA327614 AI042512 AI383957 AA1 56873 AJ333101 N70808 AJ1412&4 
AT383191 AI4O1237AIOar7O9AH034OOW84549T9^ 

AA625146 RKB79 AM 33207 AI132980 AI133214 AB84826 AB81615 AI133473 A1174852 A1133404 AJ133272 V00494 M12523 
M12523 AE07526 AM33120 AH)848Q2 AM74993 AM 1473 A1DS1645 AHJS4716 AI0S4959 H773B8 T857CS AF07538 AM 10799 

30 D17107 N&LG00477 AF190168 R50724 AI248416 AE07432 AI133684 AI133345 AJ174710 A1133280 AM33304 AJ174948 

AE207484 AM10717 AF074624 AI114515 AF0S3516 AM1CS42 AM 14559 A1114498 AJ114759 Ai207568 AJ0S4SS0 AM74753 
AI1146S6 R69184 R0D011 AI0S4397 T6C501 AB07701 T71735 AA385318 H735S9 T604SS H94399 AI133158 T74675 AA484750 
T73413 T55S09 R5Q261 T720S1 N80533 T51183 T7493S AE074S0 AM32925 A10$4701 A1174748AI1146S3AI133104AI132999 
AJ1331C0 A10S4925 AES4979 AI133CS3 AA343347 T69091 AA233989 T39772 AM44620 T5230 D16931 T4C012 T48403 

35 75B926769195 AM330S1 750850 AWC0S77 AHJ91136 AA334608 757411 Z20979N56507T87485AI133622AA343370T4C075 

T69G71 T53849 T74820 AF075316 AI110818 740121 T57381 All 14468 AA332728 151362 AH14589 R0SG91 A1110S29 
AFCS3503 AM40543 AA334661 AA332720 AA343262 T73513 T86549 AI114840 T57284 T39981 TB1407T72757T51749 
75663) AA3431S T72126 R94135 T83028 T39972 T39896 AM74786 AI132326 R03237 AKJS4838 AM32560 TKJ398 788753 
T55930 792126 AJ444602 10093$ AI114792 r©3911 AM331C8 R10779 AI0S5020 T90925 750889 D170aAM337Q3AA333805 

40 AM33040 AM33017 AICS4857 AM 10730 AF074637 AT207557 H71 080 T73217 AA343950 AM74743 AA334224 AA334281 

RCSS92 T64739 T40163 760S28 TB1661 T73179 R01842 AA50173Q T39931 T39662 T40135 AA334904 T71425 H77784 RQQ874 
AtQSStfd 764512 T55918 AI207595 739951 AA005016 T60361 769176 T73356 7587% 761233 T39S55 T6CS12 AM 14676 
AICS4778 AA035710 W52763 AM 14786 T83564 AA341859 T81684 T55769 AI114710 T51776 AA343213 AM 14714 T58102 
AI11C8© R28984AI174854 AA305575AA343592 753836 T46888 764721 755508 W05241 754019 757945 760513 T48364 

45 AR)753CBV\rcS731T82851T48269H54053T 

R02576 795666 ABQ3974 AI189471 AAC05147 AI4781Q2 A1207662 AM92792 AI768421 AICS4737 AW051713 AA93&693 
AI1331 17 AT76S232 AI913646 783SS2 AES51 12 AE07689 AM74684 AI207702 T81475 AM33325 AKB2512 AA701 169 AB3S354 
AI114720 AJ433289 AA046980 AI823482 AM 14536 AA&0S51 AW242644 R07469 AW300438 AH33416 AW271670 AE&1363 
T78943 A1823481 AA845518 AA719124 AA883454 76883) T69115A1935509 AI150977 762890 T71374 768294 AM74774 

50 . 757411 TB8318AI084689 756824 769010 75B9S2 7682)2 AB32829 77^08 AI054819AI205B80 762895769430795111 

AAOK050 T73330 W5857 T71 984 7691 1 8 W92684 AM 1 4860 7B2093 761797 AI522333T73322 H92981 756018 761811 
757232 ABK158 761821 769457 762S00 762912 772917 T4K85 AI702448 757212 757203 R94581 771311 761819 7B9358 
767708 770918 759166 AI1871 11 764308 762071 769427 AM 14750 750430 R09734 769033 769141 T69453 767908 R16809 
769394 AI207729 755839 790273 773339 AW1949Q9 775486 771 850 771305 T71287 753877 T73452 768852 M75230 

55 AJ3128S0 767751 AI174983 751679 754851 H69880 N73734 AA443453 T73466 H69S72 N53B69 76B447 D11809 D12412 

764300 72B321 755864 
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TABLE 4: Figure 4 from BRCA 001 US 



5 Table 4 shows genes upregulated in tumor tissue compared to normal breast tissue. 



Ptey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

10 UnjgenelD: Unfeene number 

UnigeneTife: UnfeenegeneWe 

R1: Ratio of tumor to normal breast fesue 



IS Pkey ExAccn UnigemrtD UnigeneTffle R1 

100113 NM_001269Hs.84746 chromosonieaino^nsafionl 2.3 

100114 X02308 Hs.82962 mymWyiatesynftetase 2S 
100131 D12485 Hs.11951 ectonudeoSde pyrophosphatase^ 1.9 

20 100146 BE185499 Hs2471 K1AA0020 gene product 1.9 

100147 D13666 Hs.138348 ostecirfastspecfeiactor2(f3Scicto 75 

100154 H60720 Ks^1892 KIAA0101 gene product 92 

100163 W44671 Hs.124 geneprecDc^tomcDNAwi^acanp^ 1.6 

100220 AW015534 Hs217493 ajradftA2 2.0 

25 100265 D38521 Ha.1 12396 WAA0077 protein 1.5 

100271 BE160081 Hs256290 S100 catium-Umfing prote&i A1 1 (cafeizzarin) 135 

100275 BE242802 Hs.154797 WAA0090 protein 5.1 

100323 D50920 Hs23106 WAA0130 gene product 19 

100335 AW247529 Hs.6793 plaietefcicth^ tato^ 2.7 

30 100364 NM_004341Rs.154868 cart»noyH)to$phatBsy^^ 2.0 

100372 NMJ014791HS.184339 WAA0175 gene product 2.6 

100393 D84145 K&39913 ncveJRGO^xmteWng protein 32 

100400 AW954324 HsJ5790 phospha&fySnos^glycaaciassC 1.5 

100418 D86976 H&84790 K1AAQ225 protein 2.0 

35 100482 M6502B H&81361 heterogeneous nuctear iiboraideocTOtetoA/B 23 

100518 NM-0D4415H&74316 desmop)a»n{DPI > DPU) 1J9 

100666 105424 Hs.169610 CX)44aji^(h«7ingfura^andmd^ 5.7 

100667 L05424 Hs.169610 CD44anft^iJwn^furicto 9.0 

100668 L05424 Hs.169610 <^ana^(hc^funcfionart 7.6 
40 10067B AW502935 Hs.740 PTK2protekityrosfr» kinase 2 532 

100685 AA32S229 Hs>184582 ribceomai protein 124 1.8 

100630 AA383256 Ks.1657 estrogen receptor 1 1.6 

100783 AF078847 Hs.191356 genetal transection factor IHpolypeptkie2{44|{Da4)uniO 5$ 

100850 AA83S472 Hs297939 caftepshB 17 

45 100892 BE245294 Bs.180789 S164 protein 1.7 

100945 AF0Q2225 Ks.180^6 ubio^ protein IJgaseE3Ai>nn^ 15 

100969 AA157634 Ha.79172 solute carrier tamfiy 25 faitochc^cajto o\3 

100988 AKD00405 Hs.76480 ubiquni>ae4 11.4 

100999 K38765 Kl80706 diaphoiase (NADHrtlADPH) (qrtccrirome b-5 reductase) 1.6 

50 101031 J05070 Hs.151738 niatnxmetaI!oprotBf^9 82 

101045 J05614 gWtumanpictfera^ceflnuc^ 5.0 

101077 N9S692 Hs.75227 EnpHcaty selected frro 23 
101093 106419 Hs.75093 pnxottager^ysine, 2<arxjWa^ 

101161 NNUJ06262H5.37044 penpheiin 16.9 

55 101186 AA02O956 Hs.179881 con>teKfing tactor, betasubunfi ZO 

101216 AA284166 fe84113 <9C*Mfepeno^tfr^ 1A 

101228 AA333387 H&82916 crapenxinccfdarmigTCPI, 1.7 

10/1247 AA132666 rfe.78802 glycogen synumtinase 3 beta U9 

101249 L18964 Hs.1904 protein kinase Q iota 15 

60 101332 J04088 Hs.156346 topoisareraseP<A)B alpha (170kD) 52 

'101332 J04088 Hs.156346 t^xtonerase (DMA) D alpha (170XD) 14 

101352 A1494299 Hs.16297 C^17 (yeast) rwric^cyt^^ 63 

KH396 BE267931 Hs.78995 proffiefafing cefl nuctearanfigen 42 

101445 M212S9 gbHimanAhjnapeafebtheregk^fftD^sinaflmKiearrib 1.9 

65 101470 hiM.000546Hs.1846 tumor protein p53(LWiaimtenisynarane) 1.6 
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101478 NM.002830Hs.758 RASp21 prot^acfirator{GTPaseac9^proto)^ 23 

101483 M24486 Hs.76768 procoCagefvprofine, 2-caoglutatate 4-dkaygenase (pfoBne4^roxytase), alpha polypeptide I 55 

101540 J04977 Hs34981 X^rejatrcompleinBi^ 

101573 AW248421 Hs250758 proteasome (prosome, macropain) 26S sUbuittXA1Pase,3 1.6 

5 101580 NM012151H&83363 coagubfon factor VUJ^ssodatedttobonfclranscripO 5.7 

101592 AF064853 Hs31299 guanine nucteotkte binding protein (G protein), beta polypeptide 2 13 

101592 AP064853 Hs.91299 guanme micteoflde bbwfing protein (G pr 53 

101621 BE391804 Hs.62661 guanytete binc^ protein 1,irrterr^^ 2.4 

101702 AW504089 Ks.179574 proteh phosphatase 2 (formed 2A);ragu!a^ 13 

10 101734 M74099 Hs.147049 cut (DrosjphSaHfce 1 (CCAAT Displacement prrti*) 2.1 

. 101759 M80244 Hs.184601 sc^carrarfc^7(cattaifcaminoa^ 53 

101767 M81057 Hs.180884 carboxypeptidaseBI (fissue) 14.4 

101782 AA306495 Hs.1869 phosphogtucomutase 1 52 

101805 AW409747 Hs.75612 stress^miucecHjhwphoprote 1 (Hsp7G/Hsp90^arfcing protein) 8.6 

IS 101806 AA586894 Hs.112408 S100 caJdurrvttxfcg proieh A7 {psonashl) 83 

101810 WL000318H&180612 peraxison^ membrane piDteh 3 (35kD 32 

101879 AA178374 H&243886 rructearatfoartfgenfc sperm 16 

101911 AA441787 Hs.1 19689 glycoprotein tonnones, alpha polypeptide 313 

101920 AF182645 Hs3024 KtytDldne,dowm^egubk)rofHLAn 13 

20 101973 U41514 Hs30120 UDP4**cetyUlpha4^^ 2.4 

101983 AJ904232 Hs.75323 rjroWbi&) 6-4 

102009 BE245149 Hs32643 protBin tyrosine kinase 9 13 

1Q2036 BE250127 Hs32906 OX^(ceIdMstonc^20.&cerev^ 2.0 

102083 T35901 Hs.75117 hrtBrteuktoertarabbx^ 16 

25 102083 T35901 Hs.75117 trrterkaldn enhancer biraBn^ 13 

102107 BE2588Q2 Hs.182366 heat shock proteh 75 1.4 

102123 NM.001809Hs.15B4 centromere protein A (17*0) 13 

102165 BE313280 Hs.159627 cfeaftassodatedpiDiBh3 4.6 

102198 AW950852 Hs.74598 pdyrnerase (DNA directed), delta 2, regulatory subunit{50kD) 43 

30 102217 AA8S978 Hs301613 JlVlgene 6J 

102220 U24389 Hs.65436 lysosomal 43 

102234 AW163390 H&278554 hetefochrorra&vflte protein 1 13 

102260 AL039104 Hs.159557 karyopheitn alpha 2 rj^cortortl.fir^^ alpha 1) 4.4 

102302 AA306342 Hs39171 proieh ktoaseOfte 2 2.7 

35 102330 BE298063 Hs.77254 chromobcxhijrnotogl (DiosophBa HP1 beta) 13 

102339 BE378432 Hs35577 cyc&Htependentltoe4 23 

102348 U37519 Hs37539 ate*tyctedehy rt qger>ase3fe^ 2.0 

102349 AU077055 Hs289107 bacukmral IAP repeat-contairrfng 2 32 
102369 U39840 Hs299867 hepaiDcyt9nodeartactor3,a!pha 2.0 

40 102374 U33635 Hs3Q572 PTK7prc^ tyrosine Mnase 7 62 

102391 AA296874 Hs.77494 deoxyguanosirte kinase 13 

102455 U4S705 Hs.75562 c5s«^A»i^ receptor fenfly. memberl 63 

102465 NMJB01359Hs31548 2»44fenoyl CoA reductase 1, rrtfochondrfal 13 

102488 U50939 Hs.61828 amytoid beta precursor prcteMhd^ poteh 1. 59kD 13 

45 102489 AL080116 Hs.74420 on^recognrtacanpfex,si4iJnft3r^^ 33 

102494 AI188137 Hs.75193 COP9 hornctog 2.1 

102501 AF217197 Hs.74562 siahrjha^pr^1;FBPirdBtaco^ 32 

102S22 BE250944 Hs.183556 solute carrier femfty 1 (neutral arrtoaiHtransporterX 23 

102532 AF04G253 Hs.70186 suppressor cOyP^aevisiae) 5 homolog 5.7 

50 102564 U59423 Hs.79067 (^(rnolrm against c^caperitap^ 23 

102568 W81489 Hs223025 RAB31,mernberRAS oncogene lairfly 53 

102580 U60808 K&152981 COP^fiacyfelycerisyn^ 21 

102581 AU077228 Ks.77256 entanr»ofzBste(Dioso^ 13 

102582 U61232 Hs32675 oibufin-spec&crtaperonee 21 
55 102617 AW161453 Hs.198767 <X)P9 (axtsSuto ptoton^ 13 

1Q2618 A1037672 Hs31071 exbaoeButarmalrteprotetol 5.8 

1026Z7 A1321918 Hs.158174 zixfina^ptoteh 184 (KruppeHfte) 13 

102663 NMJOQ2Z70Hs*168075 karyopherto flmporfln) beta 2 13 

102676 BE262989 Hs.12045 pitedtoprriBh 23 

60 102687 NM_007019Hs330Q2 ubsjuffin canfer protehE2-C 43 

102689 U96132 Hs.171280 hyotacyacyUfcenzymeAda 63 

102696 BB40274 Hs239 foftheadbaxMI 42 

102704 AU07706B Hs34089 BRCA1 associated RING domain 1 13 

102705 T97490 Hs300Q2 snTalhdudbtecytotfries^ 23 
65 102750 AB014460 Ks36196 nm^xo8endomicteaseDI>fike1 12 

102801 BE252241 Hs38041 pyridoxal torfdmdne. vibrato B6) kinase 6.4 

102812 U90549 Hs236774 WglMnobiBty group {nonhistone chrorrosornal) protein 17-fike 3 1.6 
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TAMJE 3 A 

Table 3A shows the accession numbers for those pkeys lacking unigenelD's for Table 3. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
5 designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 

10 Ptey. UnquaEostMcbesdtden&fierrajmber 
CAT number Gena duster rcrmbsr 
Accession? Genbank accession numbers 

IS Pttey CAT number Accessions 

111168 3B585J A1798^S464MAW811617AVW11616W 

AA05555S AW858231 AW857541 AW814172 H66214 AW814398 AF134164 AA243093 AA173345 AA1 99942 AA223384 
AA227092 AA227080 T12379 AA092174 761 139 AA149776 AA899829 AW879188 AW813557 AW813538 AI267158 AA157718 
20 AA157719 AA100472 AA1C0774 AA130756AA157705 AA157730 AA157715 AA053524 AW849581 AW854566 C05254 

AW882836 T92637 AW812621 AA2DS583 AAHEJ204 BE15S909 AA226824 A1829309 AW991957 KSS951 AA527374 H56215 
AA045564 AI6942S5 B60808 AA149726 AW195620 BE081333 BHJ73424 AW817662 AW817705 AW817703 AW817659 
BE081531 H5K70 

1022086735 9 U22SS1 AA203S23 AA503337 AM74733 All 92802 00882 AA035357 AI1S0S1 9 A1199244 AI828450 AA6Q2298 AJ3781^ 
25 " AI209170 AI186653 A1127795 AM 83846 H77389 AK89465 AA629390 H943C8 A1018388 R68584 AAQ271SS AI745413 AI585092 

AI093428 AJ623873 AKJ74570 N5Q098 AA047486 N25060 AA327614 AHW2512 AI383957 AA156873 A1333101 N7080S AI141254 
AB83191 AM01237AIQ80709At093400 VW4549 790808 R0C012WM413AA630557AB 

AA625146 R68379 A1133207 AI132980A1 133214 AI0S4826 AI051615 A1133473 AJ174852 AI133404 AI133272 V00494 M12523 
M12523 ABQ7526 AI133120 AT064802 AH74993 AH147a A1081645 A1064716 A1084959 H77388 T85708 AF075298 AI110799 
30 D17107 N&U00477 AF190168 R50724 A1248416 A1207432 AM33684 AI133345 AM74710 A1133290 A1133^)4 AI174948 

A^484AI110717AIW4624AI114515AF0S3516AI11<^ 

AI114686 R69184 R00011 AICS4997 T6Q5M At207701 T71735 AA385318 H73569 760498 H94339 All 33158 T74675 AA484750 
773413 T59909 R50261 772)61 M80533 751189 T7493S AJ2074S0 A1132925 A1CS4701 AI174748 AH14663 AJ133104 AH32999 
AI1331TO A1084a25AN84979Att33063AA343347 768091 AA23^ 

35 758926 769195 AH33051 750850 AM00877AI091 136 AA334608 75741 1Z2D979N56507 787485 AM33622AA343370 T40075 

T69671 T53849 T74820 AF075316 AM 10818 T40121 757381 All 14468 AA332728 T51362 AM 14589 R08S91 AI11C829 
AF083503 AI140543 AA334661 AA33Z720 AA343262 T73513 TB6549 AI114840 757284 739981 T61407 T72757 751749 
756630 AA343125 T72126 R94135 783028 T3S972 T39895 AI174786 AI13326 R09237 AKJ84838 AM33660 760398 TB8753 
755930 T92126 AW44602 7609SS AM 14792 *B391 1 AM3310S R10779 AKJ65Q20 7S09S 750889 D17Q3 AJ133703 AA333805 

40 AI133OC0 AM33017 AI084857 AM10730 AP074637 A12I75S7 H71080 T73217 AA343950 AI174743 AA334224 AA334281 

RGS692 764739 T40163 760628 731661 773179 R01842 AA501730 739931 T39562 T40136 AA334S04 771425 H77784 R00874 
AKJ55049 T84512 755918 AI207595 739951 AAD05016 760361 769176 773356 T58795 T61233 T39955 760812 AI114676 
AICS4778 AA035710 W52763 AM 14786 T835S4 AA341859 T81684 T55769 AM 14710 T51776 AA343213 AM 14714 T581D2 
AI11Q809 R28984 AI174854 AA305675 AA343532 T53B36T46869 764721 755508 W05241 754019 757945 T60513 T48364 

45 AHJ75308W8S731 T82851 T48289H54053 773211 AM145S0 T48317 T55SS5 T74857 R84226T58552 752231 774946 T76976 

R02576 T95666 A&03974 AM 89471 AA005147 AI4781Q2 A1207662 AM92792 AI768421 ANS4737 AW051713 AA936693 
A11331 17 AI76S232 AB13646 T839S2 AICS5112AK77689 AM74684 AI207702 T81475 AI133325 AKBS12 AA701 169 AB36354 
AI114720 AJ433289 AA046980 AE823482 Ah 14536 AA880S51 AW242644 R07469 AW3C0438 AM 3341 6 AW271670 A^1363 
778943 A1823481 AA845518 AA719124 AA883454 768850 769115 A1935509 AM 50977 771374 7K294 A1174774 

50 . W41176B318Aro84689756624769010T68982768302Al33282977^At^ 

AA025050 T73330 W52657 T71984 769118 W92684 AM 14860 T62093T61797AI522333T73322 H9S81 756018761811 
757232 AB38158 761821 769457 76290076312772917 T46885 AI702448 757212757203 R94581 T71311 T61819 789358 
76770 770918 759166 AI187111 T64&8 T62071 769427 AM14750 760430 R09734 T69033 769141 T69453 767908 R16809 
769394 AI207729 T55839 T90273 T73339 AW1 94909 775486 T71 850 T71 305 771287 T53877T73452T68852 N75290 

55 AB12890 T67751 AI174983 T51679 T54851 HB9880 N73734 AA443453 7734® K89872 N53B69 T68447 D11809 D12412 

T643C0T2B321 755864 
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TABLE 4: Figure 4 from BRCA 001 US 
5 Table 4 shows genes upregulated in tumor tissue compared to normal breast tissue. 



Pkey. Unique Eos probesetkterrtffier number 

ExAccn: Exemplar Accession number, GenbanJcatxsssbn number 

10 UnfeenelD: UnJgene number 

UnjgeneTffle: Urugenegenefflte 

R1: Rafoof tumor to normal breast tissue 



IS Pkey ExAccn UnigenelD UmgeneTIHe 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



00113 
100114 
00131 
00146 
00147 
100154 
00163 
00220 
00265 
00271 
100275 
100323 
100335 
100364 
100372 
100333 
100400 
100418 
100482 
00518 
00666 
00667 
00668 
00676 
00(35 
100690 
100783 
100850 
100832 
100945 



00988 



101031 
01045 
01077 
[01033 
101161 
01186 
01216 
01228 
101247 
101249 
101332 
01332 
01352 
01396 
01445 
01470 



NM_001269Hs34746 
XQ2308 Hs.82962 
D12485 Hs.11951 
BE185499 r&2471 
D13666 Hs.136348 
K60720 Hs31892 
W44671 Hs.124 
AW015534 Hs^17493 
D38521 Hs.1 12396 
BE160081 H&256290 
BE2428Q2 Hs.154797 
D50920 Hs23106 
AW247529 Hs.6793 
NM_004341Hs.154868 
NMJD14791HS.184339 
D84145 Hs39913 
AW954324 Hs.75790 
D86978 Hs.84790 
M65028 Hs31361 
NKL004415H3.74316 
L05424 Hs.169610 
105424 Hs.169610 
L05424 Hs.169610 
AW502935 Hs.740 
AA328229 Hs.184582 
AA363256 Hs.1657 
APD78847 Hs.191356 
AA836472 H&297939 
BE245294 Hs.180789 
AF0G2225 Ha.180685 
AA157634 Hs.79172 
AKD00405 Hs.76480 
K38765 Hs30706 
J05070 Hs.151738 
J05614 

N99692 Hs.75227 
L06419 Hs.75093 
NMJ»6262Hs37044 
AA020956 Hs. 179881 
AA2841G6 H&84113 
AA333387 Hs32916 
AA132666 Hs.78802 



chromosome ccnaensafion 1 
thymkfylafe synthetase 
ectoriucJeolktepyrcftospr^^ 
WAA002Q gene pfooud 
()siBottastspecfofect^ 
K1AA0101 gene product 

gene predicted tan cDNA wifo a conplete caflrig seajience 
aimeodhA2 
KIAAQ077 protein 

S1 00 (aWurn-bWing protein A1 1 (cajgizzarin) 
WAA0090 protein 
WAAD130 gene product 
ptaiefc*adrvating factor acetyihydrobse, Isota 
carbainoyHptophato synrnela» 
WAA0175gene product 
wj^RGOcontamtog protein 
pnosphafidySnos&ri glycan, class C 
K1AA0225 protein 

heterogeneous miclearnl>onucleopro!emA/B 
desmopbWn(DP»,DPH) 

CD44 anfigen (honing function and Indian blood group system) 
CD44anflgen(rtc^furtc8c*a^ 
CD44 anfigen (homing luncfion and maTanbtood group system) 
PTK2 protefci tyrosme kinase 2 
ribosomaJ protein L24 
estrogen receptor 1 
gerieraltransCTtptobc^ 
cafhepsinB 
S164 protein 

ubiquffin protein Dgase E3A (human paptBoma vkus E6-assoaated protein, Angelman syndrome) 
solute canter fiamDy%(rna^^ 
ubiqu^n-ln\e 4 

dlaphotase (MADH/NADPH) (cytochrorrieb^ieAicta» 
matrix nwtaJbpfotelnase 9 (g^ 
gbttun^ptolferBfirgceln^ 
Envticafiy seteded fomAFFX stogbprobeset 
procoBageVlystrie, 2-ocogJutan^ 



R1 

23 
23 
1.9 
15 
75 
92 
1.6 
23 
15 
135 
5.1 
13 
2.7 
23 

25 

32 
15 
2.0 
23 
13 
5.7 
9.0 
75 
532 
1.8 
1.6 
53 
1.7 

I. 7 
15 
6.3 

II. 4 
1.6 
82 
53 
23 



L18964 
J04088 
J04088 
AM94299 
BE267931 
M212S9 

NMJD00546HS.1846 



Hs.1904 
Hs.156346 
Hs.156346 
Hs.16297 
Hs.78996 



peripherin 
core-binding factor, beta subunft 
cycirHJeperdentWnase&^r^ 
cteperoriinamtairrfrgT^ 
glycogen synthase kinase 3 beta 
protein kinase C tote 
topoeomerase (DNA) B alpha (170kD) 
topofsornerase (DNA) D alpha (170kD) 
COX17 (yeast) hwijo^cytochroineca 
proliferating ceflrrudeararrijgen 
gteHuman Atu repeats fat fte region S to toe small nuclear rib 
tumor protein p53 (U-Fraumenl syrxtrome) 
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13 
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101478 NM_002890HsJ58 RAS p21 proteh acfivator(GTPaseac8vaftig protein) 1 23 

101483 M24485 Hs.76768 procdiagervprc^ 2<K^^ 53 
101540 J04977 Hs34981 X^rej^compJemanflrgdefe^ 

101573 AW248421 Hs250758 pnjteasome (prosome, macropain) 26S subunR,A"IPase,3 1-6 

5 101580 NML012151Hs33363 coagulation fc**orVin-a^^ 5.7 

101592 AF034853 H&91299 guanine nudeoikte Wrafirtg protete (G protein), bete polypeptide 2 1.8 

101592 AF064853 Hs.91299 guanine nucleotide binding protein (G pr 53 

101621 BE391804 Hs32661 guanylate bincfing protein 1 , interfenxvtatocfofe, 67kD 2.4 

101702 AW504089 Ks.179574 pioteOTprK>sptertase2(forTne^ 13 

10 101734 M74099 Hs.147049 cut (DrosophSaHike 1 (CCAAT tispJacerrK^ protein) 2.1 

101759 M80244 Hs.184601 solute canierfeirruly 7 {cato 53 

101767 M81057 Hs.180884 carboocypepfidaseBI (fissue) 14.4 

101782 AA306495 Hs.1869 phosphoghicomutase 1 5 -2 

101805 AW409747 Hs.75612 slress4ndiK3ed-phosphoprDtein 1 (Hsp707Hsp90-ofgantang protein) 8.6 

IS 101806 AA586894 Hs.112408 S100 c^ciunvtofirig protein A7 (psorfastnl) 83 

101810 NMJ000318H&180612 peroxisomal membrane protein 3 (35kD,2eBweger syndrome) 3^ 

101879 AA176374 H&243886 nuctoautoan^enic sperm prrteto (Nstone-Nndfog) 1.6 

101911 AA441787 Hs.1 19689 glycoprotein honia^ at^ porypeptlde 313 

101920 AF182645 Hs3G24 IK cytokine. oYwn^egu^ 1-8 

20 101973 U41514 Hs30120 UDP-F^acetyWpha^^ 2.4 

101983 AI904232 Hs.75323 prohMn 8.4 

102009 BE245149 Hs32643 protein tyrosine kinase 9 13 

102036 BE250127 Hs32906 CDC20(ce9 dwstoncycte20, S. cerevisiae, homotog) 2.0 

102083 T35901 HsJ5117 bitertajkmertoarrablnc^ 1.6 

25 102083 T35901 Hs.75117 interteukin erihajxx*binc%Kjfector2,4 13 

102107 BE258602 Hs.182366 neat shock proteto 75 1-4 

102123 NfvL001809Hs.1594 centromere protein A (17kD) 13 

102165 BE313280 Hs.159627 death associated protein 3 4.6 

102198 AW950852 Hs.74598 pdyniejasetDNAaTrectea^a^ 43 

30 102217 AA829978 Hs301613 JTV1 gene 6.7 

102220 U24389 Hs.65436 lysosomal 43 

102234 AW163390 Hs27B554 hetefochiornafin^ protein 1 13 

102260 AL039104 Hs.159557 karyci)h8ffoatpta2(RAGcch^ 4.4 

102302 AA306342 Hs39171 protete kinase Wke 2 2-7 

35 102330 BE298063 Hs.77254 chfoniobDX homotog 1 (Drosophfla HP1 beta) 13 

102339 BE378432 Hs.95577 (independent kinase 4 23 

102348 U37519 Hs37539 aickriyde otayrtoger^ 2.0 

102349 AU077055 Hs^89107 bacubviraJ IAP repeakantaJnteg 2 33 
102369 U39840 H&299867 bepatDcytenudear factor 3, alpha 2.0 

40 102374 U33635 Hs.90572 PW protein tyrosro kinase 7 63 

102391 AA296874 Hs.77494 deoxyguanostee kinase 13 

102455 U48706 Hs.75562 olscok^Axr^ receptor tarrary, nier^ 63 

102465 NH.001359Hs31548 2,4dieiKiylCoAieAiclase1 > m%xiton^ 13 

102488 U50939 Hs.61828 airrytoxl bete precursor rproteWrf^ 13 

45 102489 AL080116 Hs.74420 a^iecogrtftaran^ suburb ^ 33 

102494 AI188137 Hs.75193 COP9homolog 2.1 

102501 AF217197 Hs.74562 siah blra^ protein 1; FBP interact^ 33 

102522 BE250944 Hs.183556 solute canier lairtyl (neutral amino atf 2.8 

102532 AP040253 Hs.70186 suppressor cfTy$xercv^ 5.7 

50 102564 U59423 Hs.79067 MAD (moftiers aga^decaperrtaptegte, DrosophBa) homotog 1 23 

102568 W81489 Ha 723025 RAB31,men*er RASorwogenefamOy 53 

102580 U80808 Hs.152981 COP-<Sacylglycerdsynm^ 2.1 

102581 AU077228 Ks.77256 enhancer of zeste(Dro»prfo^ 13 

102582 U61232 Hs32675 bibufevspecfcchapefonee 2.1 
55 102617 AW161453 Hs.198767 COP9 (oxis«uOve photornc^ harokxj) suburb 13 

1Q2618 AL037672 Hs31071 extrace&itaf matrix protein 1 53 

102627 A1B21918 Hs.158174 ztec8ria^piDteki1840<n^peUke) 13 

102663 NM.002Z70H&168075 kafyopheite(knpoftin)b8ta2 13 

102676 BE262989 Hs.12045 putato protein 23 

60 102687 NM.007019HS33002 libknifromter protein E2-C 43 

102689 U98132 Vte.171280 rry*DxyacyK*eru7rr)e^ 63 

102696 BE540274 Hs339 forkhead box Ml 43 

102704 AU077058 Ha34089 BRCA1 associated RING domain 1 13 

102705 T97490 Hs300Q2 soialMjcbtecytokmesuD^ 23 
65 102750 AB014460 Hs.66196 nlh^xo«enA)niicteaseni>fike1 1-2 

102801 BE252241 Hs38041 pyrktaxal {pyrWaxtee, vftamh B6) kinase 6.4 

102812 U90549 Hs336774 hfc!wnoi>IIityp^(r^ 13 

115 



WO 02/059377 



PCT/US02/02242 



6.3 
8.8 
3.0 
1JB 
5.6 



1028Z7 BE244588 Hsj6456 chapaxH*a)ntabkigTCPl,3iittK*2(beta) 

102831 AA262170 H&8Q917 adaptor-related protein complex 3, sigma 1 sutomtt ri 

I02844 AV653790 Hs.324275 WW domakwonlaWng protein 1 ^ 

102888 X02419 Hs.77274 pfasmhogen activator, urokinase * 

IQ2925 B&440142 H&2943 signal ceoognifionpaiflcte19kD 1~ 
IQ2S35 BB61850 Hs30506 smalrwctearnTo^^ «._^>u^Wai« r^h^rrtLoi 
[02968 AU076611 Hs.154672 mefoylenetetr^^ 

102983 BE387202 Hs.118638 notHnrfastateceBsl, protein (NM23A) expressedto *] 

102385 U95742 Hs2707 G1 to S phase transffionl . 71 

103023 AW500470 Hs. 117950 muffifurcfloi^ pol^^ ]* 

103038 AA926960 Hs334883 COC28 protein kinase 1 J? 
03060 NMJW5940HS.155324 matrix metaloproteinase 11 (K*^1;storner/sln3) 

103080 AU077231 Hs32932 cydinDl pRAMrparalh^ adenomatosis 1) *■ 
103089 D31152 Hs.179729 wtagen.trpeX.afrha 1 (SdnnU metaphyseal 

103177 BE244377 Hs.48876 femesyklptosptetefa^^ * 

103178 AA2Q5475 HsJ75865 rfbosomal protein S18 . ... . To 

103179 NM_001777Hs^2685 CD47 anfigw (Rh-rebted an^en, htegn^assoca^s^ transducer) J* 

103181 X69636 Hs334731 Hon»sap^cbr»IMAGE3448306,ntf^partolcds ^ ^ £J 

103185 NML006825HS.74368 transnen*)raneprcteh(6^ I] 

103191 AA4O1039 HSw2903 pra^phc€pra^4(ftxw)yX),cateJyfcsub^ ^ r? . 

103193 NM_004766Hs.75724 roatomBrproteh complex, subunabefa2(betaprfrne) « 

103194 WAJ004939HS.78580 DEAW(Asp-G!u^la^spAfis)^ 
103206 X72755 Hs.77367 mcocHnefriduo9dbyganmBlrtoteron 
[03223 BE275607 Hs.1708 chapaonin coniaWng 
J03232 X75962 Hs. 129780 tumor necrosis factor ieoeptDrsuperfan^,meinber4 
103238 AI369285 Hs.75189 deaflwsstxfeted protein 

103297 NM_001545Hs.9078 branatue colon cairirorratransai^ ■ 

[03330 AI803447 Hs.77495 smal nuclear ribonudetjp^ fx 

J03349 X89059 gbiisapiensii^tountawwnD^ J* 

103376 ALQ36166 Hs323378 coated veskfe membrane p^otBh . . , J* 

103391 X94453 Hs. 114366 pyrotoe^aAoxylate ff 

103392 X94563 g&^aplensdbifatxvp^exonl 82. *™ 
[03430 BE564090 H&20716 trarsJocasetf irmmitoc^ I~ 
103491 AF264750 Hsl288971 rnyefoio7!yrrg>r^ J~[ 

[03505 AL031224 Hs331Q2 transcn^ tactor AP-2 beM , . - q2 

103547 AI376722 Ha.1800S2 proteasome (ptosonie. rnac^ 

103588 NM-006218HS.85701 phosphokmsffiM^anase, calalyb^ alpha polypepflde ^ 

103613 NMJ»0346H3^316 SRY(sex*iterninIngregta ™ 

103621 BE379766 Ks.150675 pclyrn8iase(RNA)U(DNAdrecte^ tz 

103622 AA6Q9685 H&278672 rnembiane oofrqwwnt chronosore 11, surface inarkeM *-f 
IQ3727 A1878883 Hs-296381 growfe factor receptor-bound pcoten 2 

103749 AL135301 Hs*768 hypofceflcal proa* FU10849 ril/c ^ ()el(mjl ]~ 

103754 AI015709 Hs.172089 Homo sapiens ntf^cDNAWCF^^ « 
[03780 AA094752 Hs.169992 hypdheScaI43^KdproJeki 

IQ3795 H26531 Hs.7367 Hc*rosapkraBTB*rahpn^(BDPqr^ ^ ^ J* 

I03797 AA080912 gbczn04dD3.r1 Sbafc^enehNT neuron (937233) rtonosapierficONA clone ysnrtbr w 

103813 AI042582 Hs.181271 CGI-120 protein J* 

103855 WQ2363 Ha302267 hypoheficalpnjtefo W10330 J? 

I03886 AKD01278 Hs.105737 hypothefcaJ proteh FU1M^ 

104052 NKL002407Hs^7644 inaramagtabki 2 7T 

104079 AA251242 Hs.103238 ESTs J! 

104174 AA478984 Ha5451 PR0659 proteh °'° R 

104227 AB002343 Hs^8938 protocadherh alpha 9 , ^ 

104275 AI751970 Ha.101067 GCK5(p^ne!ala)ntrdoJainmo^ J;Z 

I043K BE379766 Hs.150675 pdynwase^ ^ 

I04370 AA324597 Ha21B51 Hontt>sap^c£)NAFU12900f^ctor«hrra^^^ ^ 
104423 R83113 Hs.1432 protein toiase C substrate 80K4i 

I04482 AB037762 Hs.44268 myefinc^ expression tactor 2 ^ 

104532 AM98763 Hs503013 nypothefica! protein FU1Z748 f-J 

I04563 AL1 17403 HsJ506189 DKFZP434F1735proteki J-f 
I04667 AC39923 H&30098 ESTs 

(04757 AI694413 Hs332649 otfadory receptor, tamfiy 2, suWam^ ^ 

I04804 AI8587Q2 H&31803 ESTs, WeaWy simQar to N-WASP [Usapiens] ^ 
104806 ABQ23175 Hs£2982 KIAA0958 protein 

104827 AW052D06 HsA551 PRP4/STK/WD splicing factor ^ 

[04846 A1250789 HsJ2478 ESTs 

[04854 AA041276 Hs.154729 3^)TK)Sphoir«sifetedeperdert 
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04867 AAZ78698 H&225979 hypothetical protein sknBar to smaDG proteins, especially RAP-2A 2.0 

04871 T78044 H&28893 Hcroo sapiens rrtfWcONADK^ 13 

04896 AW015318 HsJB165 ESTs 17 7 

04909 AW408164 Hs£491B4 transcription factor 19 (SCI) 5.0 

04916 AW958157 Hs.155489 NSI-assockted proteh 1 17 

104919 AA026880 Hs^5252 prdacfin receptor 1*4 

04930 AF043467 Hs32893 iteurexophOn2 ^ 

04973 NM.015310H&6763 WAAQ942 protein 5.0 

104974 Y12059 H&278675 bnxraxk)makHX)ntainin94 1.4 
[04975 AL136877 Hs30758 S*C4 (stnxtoral nafote^ 2.4 

104975 AL136877 H&50758 SMCM(stru*ira]mahtenancetf 23 

04978 AI199268 Hs.19322 Homo sapiens, SimBar to RDXEN cDNA 2010317E24 gene, done 1MAGE35G2019, mRNA, parfed cds 72 

04979 AA937934 Hs321062 ESTs 1 -3 
04994 M499930 Hs334885 mBodBntlrial GTP binding protein 33 
05009 BE379584 H&34789 (toflchyWjjtosphoctf^^ 5 -5 
05012 AF098158 Hs3329 chromosonie 20 openieaolng frame 1 33 
05026 AI050715 Hs^331 E2F transcription factor 5, p130-birurlng 22 
05032 AA127818 gfczH2a02s1 Sc^jiregnariLutenisJftHPU Homo sapiens cDNA clone IMAG&501674 y 6.8 
05039 AA907305 H&36475 ESTs 25 
05041 AB037716 Hs.26204 KEAA1295 protein 22 
05045 BE242899 Hs.129951 specMB^POZ proteh 33 
05079 AA151342 Hs.12677 OSM47 protein 93 

05087 AA147884 Hs3812 Homo salens cDNA RJ14388 fts, clone HEMBA1002716 5.6 

05088 H58589 Hs35156 Homo sapiens cDNA FU1 1027 fis, clone PLACE10041 14 22 
05095 Z78407 H&Z7023 ve^tanspoiMelatedprotebt 22 
05110 BE387350 Hs33122 WAA1 160 protein 1-6 

05126 AW975433 Hs36288 ESTs 6 - 3 

05127 AA045648 K&301957 ruKfo(ra)cteaside(8phosr^ 2.1 
05141 AA164687 Hs.177576 mamc^ (a^ Ws*ycco^ 2.7 
105158 AW976357 HsJ234545 hypoftafical protein NUF2R 1-9 
05169 BE245294 Hs.180789 S164 proteh 1-7 
05186 AA191512 H&28005 Homo saptens cDMA FU 113096s, clone PLACE1010076 43 
05254 AA071276 Hs.19469 WAA0859 protein 1-9 
05281 AA263143 K&44596 RADSWnteracBng r^rotem 23 
I05288 N99673 Hs3585 ESTs, Weakly sfcnBarto AF126743 1 DNAJ dcmaln-containkig protein MCJ [RsapiansJ 13 
05302 AA700122 Hs3355 serdrm-specfc protease 8*0 
05331 AW270037 Hs.179507 K1AA0779 protein 1* 
105359 NMJM6015Hs3054 CGK68 protein 82 
105366 BE264645 Hs262093 hypometfcal protein FU21918 5.0 
05373 AW887701 Hs.32356 hypoBiefea) proteh FU2062B 23 
105374 BE242803 Hs^62823 hypoftefcal proteh RJ10326 22 
I05387 AW592146 Hs.108636 membrane protein CHI 23 
105393 AF167570 H&256583 mterteu)teertocerbhdbigfacter3.^ 5.4 

105399 BE386877 K&334811 tyw3M>iK^ proteh NpwBP 13 

105400 AF198620 Hs.65648 RMbWirgrrKtf protein 8A 13 
05445 AA252395 gfczs1^r10.s1 NCLCGAPJSCB1 Homo sapiens cDNA clone IMAGE585026 3\ mRNA sequence. 5.0 
105507 BE268348 H&226318 CCR4-NOTtransc^coin^ 13 
105529 AA113449 Hs32471 hypothetical proteh FU2D364 1-3 
05530 AB023179 Hs3059 K1AA0962 protein 3 4 
105547 AA262640 Hs^7445 unknown 9 - 3 
105564 BE616694 H&288042 hypo&ieScal proteh FU14299 1-4 
105596 AA579535 Hs.18490 hypomeSca) proteh HJ2D452 103 
05597 AF054284 Hs334826 spiting factor 3b, suUmU155H> 23 
05608 AI6082O1 r&287863 hypo th etical proteh FU12475 1.7 
05610 AA280072 Hs39S72 fetal AkheBner antigen 14 
05617 AMB00892 Hs.4069 pJucccotfroWmod^^ 1-7 
103620 AW3Q2245 Ks.181390 rasehHnase1,#mvna2 53 
03658 AA985190 H&246875 hypc*e8cal proteh RJ20059 9.4 
05697 AW499988 Hs^7801 zinc finger proteh 278 23 
03708 R26944 Hs.180777 Hoino saptens mRNA; cONADKFZp564M02Mf^^ 1.7 
05743 BE2465Q2 Hs3598 senia fomain, lnimur»g^^ 2.6 
103746 AW151952 Hs.46679 hypothetic* proteh FU2D739 13 
105759 AI123118 Hs.15159 chernc^he^factcf.alterna 1.3 
103771 AB67720 Hs.153221 syiiovlalsarcaia,trarBlocatedto 1-6 
I03820 AA741336 Hs.152108 transactional unB N143 22 
05826 AM78756 Hs.194477 E3irokroi&>f&asBSMURF2 1-3 
03856 AI262106 Hs.12653 ESTs ZA 
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105858 AF151056 H&281428 hypotofical protein 25 
105875 AKD01708 Hs32271 hypotheftal protefci FU10846 

105930 AF016371 Hs3880 pepftiyi prolyl (somBraseHtcydophDnH) ^ 

IO600O AW194426 Hsl2C726 ESTs _ _ I*' 

106011 AW0812Q2 Hs.12284 Homo saptow, done BMG&29B9555, mRNA, partial cds 2£ 

106017 AM77956 Hs^6268 ESTs J* 

£ MOT toSS S^S^ ALU SUBFAMILY SX SEQUENCE CONTAMINATION 1 £ 

106094 AA533491 Hs23317 hypoftiefiCBl protein HJ146B1 J* 

]06140 AD006624 hfe.14912 WAA0286 protein 1 -° 
106271 AA251393 Hs^89052 Homo sapiens. Stmiarto R1KEN cDNA 5430429M05 gene, done MGC:13155, mRNA, compleiB ctb 10.8 

106288 AB037742 H&24336 WAA1321 protan J* 

106300 Y10043 Hs.19tt4 hfcMnob^ group (ronMs^ » 

106333 ALD43114 Hs22410 ESTs, Weakry sfrnBarto A54B49 colagen alpha 1(V1I) chain precursor [H^apfens] 5.4 

106350 AKD01404 Hs.194698 cycfinB2 *.7 

I06339 AW390282 H&31130 transmernbfane7sur^&iar^ « 

(06381 ABO40916 Hs24106 MAA1483 protein ^ 

I06389 AW748420 Hs5236 Homo sapiens cDNA:aJ21487fis i cte»COl05419 ^ 

I06457 AF119256 HsJZ7801 zinc finger protein 278 

106470 D63078 Hs.186180 Homo sapiens cDNA:rXJ23038fis i dor»UK30^ f-f 

I0S531 AA454036 H&8832 ESTs ]* 

106586 AA243837 Hs57787 ESTs ]~. 

106589 AKD00933 Ha28681 Homo sapiens cDNA RJ10071 fis, done HEMBA1001702 ^ 

[06610 AA458882 Hs.79732 ffl)u8n1 \% 
1 06624 NM 003595H&26350 tyrosytprotein s uHu tian3ferase2 

,06650 AL049951 H*22370 Homo sapiens n^dX*DKFZp6^40W2 . , \\ 

106669 AV657117 Hs/184164 ESTs.Moo^ratelysfeitoteS65657a^1(>aAen^ ]-3 

106713 BE6148Q2 Hs.184352 hypothec protein FU12549 J* 
106717 AA600357 H&239489 71A1 cytotorc granule-associated RNA-broJr)g piotem 
108723 BE388094 H&21857 ESTs 

106795 AF174487 Hs^93753 Bc^rofatedovanwktoproteln^e ™ 
I06829 AW959893 Ha27099 tiypofoe&al proter FU23293 srir^ 

i06831 BE564871 H&29463 centrin, EF-tod protein, 3 (CDC31 yeast homobg) 1-5 

[06846 AB037744 H&34892 WAA1323 protein ~ 

106852 AF151031 rfc300631 hypofteficalprotetn J-* 
I06873 N49809 Hs.11197 Homo sapiens, done MAGE3343149. mRNA.par8alcds 

106886 W79171 HsS567 GUXEproteso J* 

1 06906 AA861271 Hs222Q24 toanscripttontactorBMAtf ft 

106920 AK001838 H&296323 senjnV&h«>corBooid regulated kinase f* 
[06945 AKD00511 Ha.6294 hypofte^pn^DKFZp434l1^ 

I06973 BE156256 Hs.11923 hypometical proteSti ™ 

106977 AL043152 Hs£0421 WAA02Q3 gene product 

106978 AW631480 H&8688 ESTs ™ 
107004 AA146872 Hs3O070O typoftefical protein FU20727 / 

^07029 AF264750 Hs3B8971 rmjitoWlyinphc^^ *f ]* 

107071 AW385224 H135198 edonudeob^ pyrophosphate^ y 

107113 AKD00733 H&23900 GTPaseacfiwfing protein 23 

107125 AKD00512 Hjl69388 riypomefcal proteh FU20505 ]*{ 

107136 AV661958 Hs£207 GKD01 protein J* 

107136 AV661958 Hs£207 GKD01 protein 

107146 AK001455 H&5198 Dovnsyndionnafficalro9tongene2 

107151 AW378065 tisSXSI ESTs 

107155 AW391927 HsJ946 WAA1288 protem 

107174 BE12Z762 Ha^5338 ESTs « 

107197 W15477 H&64639 gtoma pamogenes&related protein «•» 

107221 AW888411 Hjl81915 leufcerrfa-assodated phosptoproteto p18 (statotn) _^^ nl . . l f * 

107243 BE219716 Hsi4727 ESTs. Moderately sirto to I387» 7.4 

107248 AW263124 Ha315111 nucto receptor anepiesson^^ J* 

I07263 D60341 H&21198 translocase of outer trfiochortAtairne^^ _ Mt ^ 

S94 rteT9136 ECTs.M«te»at* 

107298 N95657 Hs^20 ESTs.Moo^ia^sfcnaartoYOLnjCAmrm^^ 2-5 

107298 N95657 H&J6820 ESTs,Mc)deratetystntotDYOJ1_C^EELH 1-7 

107299 BE277457 Hs^661 r^potheflcalpn>tBhMGC4606 iyr , JflcwwoJU 

107316 T63174 Hs.193700 HomosapteiistnRNA;cDM 2.0 

I07354 ^006299Hs56448 zinc finger protein 193 J.0 
107392 AW299900 Hs^67632 TATAelememinoAitatoryfia^l 

107481 AA307703 Hs^TSTK kmesm femfly member 4A 15 
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107529 BE515065 Hs^96585 nucleolar protein {KKE/D repeat) 3-0 
1A7CCJ AAfl013fi8 H&59S44 ESTs 

07681 £K S ESTs, Moderately simOar to AlJLF7JiUM^ ALU SUE^AA/HLY SQ SEQUENCE CONTAMINATION 12 
107772 AA01B587 HsJ03055 ESTs, WeaWy sMbr to ALU1J1UMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 2.1 

107859 AW732573 Hs.47584 polas^ voHage-gated cter^ 8.4 
107901 L42612 Hs.335952 kerafin6B 

107901 L42612 Ha335952 kerafin6B \* 

I07922 BE153855 Hs.61460 IgsuperfemBy receptor INK ~ 

I07974 AW956103 Hs.61712 pynwatB dehydrogenase!^ 

I06040 AL121031 Hs.159971 SVWSNF related, mat* assoc^^ 

108230 AA054224 HslS9847 ESTs J* 
108274 AF129535 Ha^72027 F-bc* only protein 5 

108296 N31256 Hs.161623 ESTs f? 

I08496 AA083089 Hs.339659 ESTs ** 

108607 BE300380 Hs£9476 Homo sapiens cONA RJ12758 fts, done NT2RP2001328 3.4 

108621 AA101809 Hs.182685 ESTs ]* 

108634 AVWJ22410 Hs£9507 ESTs 11 

108647 BE546947 Hs.44276 homeoboxCIO ™J 
108695 ABO2900O Hs70823 WAA1077 protein 

108717 AA122393 HsJ0811 hypo9ttficaIproWnFU20516 « 

(08740 AI089575 Hs3071 progesterone membrane btodhg protein *■* 

108828 AKD01693 H&273344 DKFZP564O0463 protebt ]•? 

108859 AL121500 Hs.178904 ESTs V* 

108872 H06720 Hs.111680 endosuffine alpha ^ 



5.6 
1.6 
&2 



108891 AI801235 Hs.48480 ESTs 

108894 AKD01431 Ha5105 hypoftefcal protein FU10569 ™ 
I08955 AA149754 Hs.195155 HorrosapleittamhOBctitrarBport 
108982 AA151708 Hs.171980 homeo box (expressed In ES cells) 1 
I08987 AA152178 H&23467 hypoftefical protein FU10633 

IQ9G02 AB028987 Hs.72134 K1AA1Q64 protein *J 

I09026 M157811 HS ' 72127 gtaoW^ 53 

109068 AA164293 Hs.72545 ESTs ~ 

109101 AW608930 H&52184 rrypothefcal protein FU20618 J* 

109112 AW419196 Hs257924 hypoftefcalprotefr FU13782 *f 

1 09124 AKD00684 Hs.183887 hypoflieficaJ protein FU22104 1*1 

109139 AJ132592 HSw59757 zir^Snger protein 281 « 

109166 AA219691 Hs.73625 RAB6 Interact Mnesln^(rabtt»esh 6) « 

109198 BE566742 Hs£8169 Ngjily expressed In cancer, riAtofeidneheptad repeals «| 

109213 NM_016603Hsa2Q35 potential miclearpnMi(»OI« W*protti « 

103220 AW95B181 Hs.189998 ESTs *' 

I09233 AU077281 Hs.170285 nucteoporin 214kD (CAIN) ^^ 1IU . , " 

I09Z70 N99673 Hs3585 ESTs, WeaWy sfertBar to AF126743 1 DNAJ dorrairwxxitaWng protein MCJ [H^aptensJ 1.4 

109273 AA375752 HsJ2719 Horrosapter6mRr^c»UDKFZp586F1^ « 
109313 AF153201 Hs36276 C2H2(Knippetype)ztoc^pn*eh 

108341 AA213506 Hs.115099 EST ^ f? 

109391 ALD96858 Hs.184245 K1AA0929 r^rotein interacfing rnx^ J* 
109420 W3603 Hs.40408 homeo box C9 

103426 N3Q531 Hs.42215 prote4f>phosptetase1, regulatory subunft 6 *JJ 

I09429 AH60Q29 Hs*1438 ESTs \* 

I09445 AA232103 Hs.189915 ESTs ]* 

109450 AB032969 Hs.173042 WAA1143 protein 

I09468 NM_015310Hs.6763 KIAAQ942 protein 

I09478 AW074143 K&87134 ESTs 

[09570 L40027 Hs.1 18890 glycogen synthase kinase 3 alpha 

I09662 FQ2614 Hsl27319 ESTs \\ 

I09825 R71264 Hs.16798 ESTs " 

H0039 H11938 H&21907 rtisiore acety tasferase f*r 

110056 AA503041 H&279009 matte Gb protein J~ 

[10085 AA603840 Hs^9956 WAA046Q protein ^ 

IKS Kt29 i^WeaWysin^^ £ 

IS SSKSSb HSE^^ « 

110242 N41744 Hs.19978 CGW0 protein „ li 

10259 H28428 fe32406 ESTs, Weakly sfrnflarto 138022 hypothetical protein [Rsapfens] 22 

10312 BE256986 Hs.11896 hypoftefcal protein FU12089 Z1 

119 



WO 02/059377 



PCT/US02/02242 



110330 AI288666 Hs.16621 DKFZP434I1 16 protein 6.2 

1109)1 H55748 gb?q94a01.s1 Soaesfey Irversple^ 6.1 

110504 H55915 Hs210859 hypothetical protein RJ11016 6.1 

110S5 H57330 Hs37430 EST 63 

5 110568 AKD01160 Hs5999 hypothefcaJ protein RJ10298 13 

110699 T97586 Hs.18090 ESTs 1* 

110705 AB007902 Hs.32168 K1AA0442 protein 1.6 

110742 AW190338 Hs^8029 hypoftieticalpr(MiMGC11256 7.6 

110761 AL138077 Hs.16157 hypofteBcal proteh FU12707 2.5 

10 110762 BHM4245 Hs30011 hypothec pnrtelnMGC2983 93 

110765 AKD00322 Hs.18457 hyrx)foe8calprotetoFU2Q315 53 

110769 BE000831 Hi23837 Homosapmd)NABJ11B12^clorttHBABA100^ 2.1 

110799 A1089660 Hs323401 apy-3W9© protein 13 

110805 T25829 Hsl24048 FK506hiridirigproteh precursor 6.6 

15 110813 AA767373 H&35669 ESTs, Ktoderately sknSar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 5.7 

110820 R33261 Hs.6614 ESTs, WeaHy simBar to A43932 rnucin 2 precursor, htesfinal [H^apfens] 3.4 

110840 N315S8 H&1Z727 hypofeScal protein FU21610 1.7 

110844 AI740792 Hs.167531 memykxotoroyKtoenzy!^ 1.7 

110854 BE612992 H&27931 hypoftefcalprotehllirl^ 4.7 

20 110856 AA992380 #j^rj06s1 SoaresjB^^ 23 

110885 BE384447 Hs.16034 hypoftefca! protein MGC13186 33 

110897 AL1 17430 Hs.6880- DKFZP434D156 protein 22 

110915 BE092285 H&29724 hypcrthefical proteto FU13187 2.6 * 

110918 H04360 HsJ24283 ESTs, Mcderater/slrnBar to reduce 1.9 

25 110958 NM.005864H&24587 sqgrmJ transduction protein (SH3 contakiing) 6.7 

110963 AKD02180 Hs.11449 DKFZP5640123 protein 2.0 

110981 AK001980 Hs24284 ADP-ribosyttransferase (NAD+; pory(ADP-ribose) poJymerase}-tDce2 13 

110984 AW613287 Hs30120 UTP-Nwrytefeha^ 13 

111125 N63823 H&269115 ESTs, Moderately similar to Z195_HUMAN ZINC FINGER PROTEIN 195 [Rsapiens] 3.6 

30 111132 AB037807 Hs33293 hypofte&al proteh 2.1 

111164 N46180 Hs.122489 Homo sapiens cDNAFU13289tB,cbneOVARC100l170 23 

111172 R67419 H&21851 Homo sapiens cDNA FU12900 fis, done NT2RP2004321 37 

111174 AL050166 Ha26295 Hon» sapiens inRNA; ri)NA 7.5 

111179 AKD00136 Hs.10760 asporin (LRR dass 1) 7.1 

35 111184 AM5486 Hs243901 Horno sapiens cDN*FU2C738 fis, done HB»0a257 6.7 

111184 A1815486 H&243901 Horno sapiens cDNAFU20738fe, clone HE 33 

111189 N67603 Ha272130 ESTs. Wealaysimaar to S65824 reverse transolptehomotogiH^apiens] 33 

111216 AW139408 Hs.152940 ESTs 1-5 

111221 AB037782 Hs.15119 K1AA1361 protein 2.6 

40 111223 AA852773 Hs334838 WAA1866 protein 4.6 

111239 N90956 Hs.17230 rrypofte&alprotetoFU22087 73 

111285 AA778711 Hs.4310 eukaryofctrardationhaiafion factor 1 A 63 

111299 ABQ33091 Hs74313 KIAA1265 protein 5.0 

111312 AK23913 Hs34504 ESTs 33 

45 111318 T99755 Hs334728 ESTs 12 

111337 AA837396 H&263925 USI-interacfing proton NUDE1, rat homolog 5.1 

111352 H58589 Hs35156 Homo saptens cDNA FU11027 8s, cfone PIACE10041 14 22 

111370 A1478658 Hs34631 brefekfinA4rtfiibiteda^^ 23 

111384 N94606 H&288969 HSCARG proteh 22 

50 111389 AK000987 Hs.169111 oxidation resistance 1 2.1 

111391 NAL003896Hs225939 sia»yttrartsferase9(C^-NeuAc^^ 5.1 

111392 W46342 Hs325081 Homo sapiens, dona IMAGE3659680. mRNA, paroalcds 8.4 
111452 R02354 Hs.15999 ESTs 2.7 
111466 AI051194 Hs227978 EST 63 

55 111549 W90638 H&20321 ESTs. Moderately simBar to ZRF1.HUMAN ZUOT1N RELATED FACTOR-1 (M-PHASE 1.4 

111585 R10720 Hs3D670 EST 13 

111627 R52656 Hs21691 ESTs 13 

111870 AB037834 Hs.18685 r^sapiererriRNA for WM14 13 protein, p 2A 

111937 BE298665 Hs.14846 Homo sapiens mRNA; cDNA DKFZp554D016 (from clone DKF7p564D016) 103 

60 111944 AW083791 H&21263 suppressor of potest transport defect 3 6.6 

111987 NM_O15310Hs3763 KIAA0942 potest 5.1 

112134 R41823 Hs.7413 ESTs; cabyntenin-2 A 

112244 AB029000 H&70823 WAA1077 proteh 14.6 

112388 R46071 Hs301693 Homo sapiens, done IMAGE3638994, mRNA, partial cds 9.0 

65 1 12456 NMJ016248H&232076 A taase (PRKA) anchor protein 11 14 

112464 AW0072B7 K&2B538 Homo sapiens cDNA:FU21086 8s, done CAS03272 1.4 

112506 AI742756 Hs26079 ESTs 3.2 
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112513 R68425 Hs.13809 hypo8>efeal proteto RJ10B48 2.0 

112752 AKD01635 Hs.14838 hypotfietteal protem RJ10773 15 

112884 AKD00004 Ks3013 Homo sapferomRNA for FU00004proteh,parfialc<b 6.6 

112923 T1Q258 Hs5057 EST 15 

5 112938 AW970826 Hs5185 K1AA1557 proteh 35 

112958 R61388 Hs.6724 ESTs 6.0 

112966 Z44718 Hs.102548 gfoc«»rta)id receptor DM 6.4 

112978 AKD0Q272 Hs.7099 hypotoefcal protein FU20265 15 

112995 AA737033 Hs.7155 ESTs, Moderate^ simBar to 2115357A TYW protefolMjiuscutus] 5.6 

10 112996 BE276112 Hs.7165 2mc finger protein 259 2.0 

113047 AI571940 Hs.7549 ESTs 19 

113049 AWS65190 Hs.7560 Homo sapiens mRNA for WAA1729 protein, parfial cds 2.4 

113069 T40707 Hs570862 ESTs 15 
113196 T57317 giryb51a03.s1 Stratagene fetal spteen (937205) Homo sapiens cDNA done IMAGE.74668 3\ 1.7 

15 113248 T63857 gteyc16e01.s1 Stratagene lung (937210) Homo sapiens cDNA done 3\ mRNA sequence 2.8 

113254 AKD02180 Ha.11449 DKFZP5640123 protein 15 

113277 AW971049 Hs.11774 proteh (pepWyH)roJyl ds^to 35 

113429 AA688Q21 Hs.179808 ESTs 1-2 

113499 AI467908 Hs.8882 ESTs 55 

20 113547 H59588 Hs.15233 ESTs 2.0 

113554 AW503990 Hs.142442 HP1-BP74 3.6 

113647 AA813887 Hs.188173 Homo sapiens cDNA FU12187 fe, done MAMMA1000831 15 
113702 T97307 gfcye53hQ5.s1 Soares fefed Brer spleen 1NRS Homo sapiens cDNA done 1MAG&121497 3", 4.4 

113722 AV653556 Hs.184411 albumh 15 

25 113759 AW499665 Hs.9456 SWt/SNF related, niatrbcasscd 15 

113777 BE266947 Hs.10590 zinc finger proteto 313 134 

113783 AL359588 Hs.7041 hypometol protein DKFZp762B226 1.7 

113791 A1269096 Hs.135578 diaabiase,cW4^*eiyl- 15 

113808 W44735 H&9286 HcrTOsap^cDf4A:aJ21278^c^C(X01832 .35 

30 113811 BE207480 Hs£994 HcrnosapterecDNA:FU22044fis.cl^ 3.1 

113817 H13325 Hs532795 hypotieficai protein DKF2p761017121 35 

113826 AW378212 Hs54809 hypomeflcal protein HJ10826 23 

113834 T26483 Ha5059 H3F«»ntalnhg fiwiWB© eodraceiWarmatto protein 2 115 

113868 W57902 KsS0744 proteasome(prosar^nBcrc)pain)26Ssubun^ 2.7 

35 113870 AL073314 Hs.16537 hypothefcal protefeusbnter to (U06944) PRAJA1 6.1 

113885 AW959486 Hs51732 ESTs 65 

113923 AW953484 Hs3849 hypofte&al proteki FU22041 sinto to FK506bi«fcq proteins 15 

113989 W87544 Hs568828 ESTs 15 

114022 A1539519 Hs.120969 Hcrnosapierisd)NAFU11562n^doreHEMaA1003197 5.4 

40 114030 AI82S388 Hs.164478 hypofceMprc*BtoFLI21939sirto^ 9.4 

114060 AB029551 Hs.7910 RIN61 and YY1 btwfing protein 15 

114195 AF017445 Hs.150926 fuca^l-phcsphateguarytv^^ 15 

114226 AB028968 Hs.7989 WAA1045 protefc 15 ^ 
114253 BE149866 Hs.14831 Horno sapiens, Sto to zfrx: finger protein 136 (done pHZ-20), done MGC:10647, mRNA. complete cds 25 

45 114262 AL117518 Hs5686 WAA0978 proteh 14 

114275 AW515443 Hs506117 WAAQ306proteh 155 

114292 A1815395 Hs.184641 fatfraddc*sataase2 15 

114309 AA332453 Hs50824 CGi-85 protein 24 

114392 AA249590 Hs.100748 ESTs,WeatysteiBartoA28996pic«nwlchpr^ 15 

50 114407 BE539976 Hs.103305 Homo sapiens mRNA; cDNA DKFZp434B0425 {from done DKFZp434B0425) 15 

114455 H37908 Hs571616 ESTs, Weakly simfar to ALU8_HUMAN ALU SUBFAMILY SX SEQUENCE 55 

114463 AL120547 Hs.40109 WAA0872 protein 52 

114464 AI091713 Hs.106597 Hcrro sapiens, Sinto to RIKENcDM 

114471 AAO28074 Hs.104613 RP42homotog 15 
55 114480 BE066778 Hs.151678 WMtoafyWhfre^^ 134 

114671 AA766268 Hs568273 hypofteflcat protein FU13346 15 

114698 AA476966 Hs.1 10857 pftmeraso (RNA) Pi pNA |d tecteolpdypeptJ(teK(125kDa) 3.5 

114730 A1373544 K&331328 HermediaiB fiamenl protein synooffln 35 

114767 AI859865 Hs.154443 iii lr ifchaiiusoroe maintenance deficient (S. cerevfciae) 4 15 

60 114774 AV656017 Hs.184325 CGI-76 proteh 3.1 

114798 AA159181 Hs54900 seiotogjcaiiy o^fhed coton cancer airmen 1 35 

114860 AL157545 Hs.42179 brwixxtomain and PHD finger containing, 3 45 

114895 AA236177 Hs.76591 WAA0867 protein 7.1 

114896 BE539101 Hs.5324 hypome&al proteh 15 

65 114911 AA236672 gfczt29f02s1 Soares ovary tumor NbHOT Homo sapiens cDNA clone IMAGE:723771 3*. mRNA sequence. 1.5 

114930 AA237022 Hs.188717 ESTs 25 

114938 AA242834 Hs£8384 ESTs 25 
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14965 AI733881 Hs.72472 BMP-R1B \\ 

I5023 AF102546 Hs£3931 dachshund prosophte) homolog ]~ 

115038 AA252360 Hs^7968 toW&e receptor 9 !f ft 

115061 AI751438 Hs.41271 Homo sapiens mRNA fuH length Insert cDNA clone EURCHMAGE 1913076 

115062 AA253314 Hs.154103 UM protein (dmliar to rat protein Hna ]■? 
'15117 AIS70847 H&5324 hypoftetfcal protti J~ 

15121 M634549 H&8B155 ESTs J* 

15206 AW1 83695 Hs.186572 ESTs f? 

I5221 AW365434 Hs.79741 hypo1heticalpi0tefiHJ1O116 ]■? 

115239 BE251328 Hs.73291 hypo&id)calprotBlnFU10881 ,_ t ^ " 

115242 AB68236 Hs383732 ESTs, Materateiy sirralarto ALU1JJUMAN ALU SUBFAMILY J SEQUENCE 14 

115278 AKD02163 H&301724 hypcrfnefcal protein RJ11301 J* 
115285 AW972872 Hs^93736 ESTs 

15291 BE545072 Hs.122579 hypofreta! protein RJ10461 

115400 AI215069 Hs.89113 ESTs ™ 
15468 AA314349 Hs.48499 tumor anfigen OP-Bp 

15471 AK001376 Hsj58346 hypafoefca) protein FU10514 ]* 

115479 AW301608 Hs778188 ESTs, Moderately sorter to 154374 gene NF2 protein frtsaptens] 4.0 

115496 AW247593 Hs.71819 adtaiyofic transfafion Miaficfl factor 4EUnc^protfib1 ™ J 

115500 Y14443 Hs.88219 zinc finger protein 200 jjj 

115553 A&75986 Hs.71414 transcripfion factor (SMIF gene) « 

115581 A540842 Hs.61082 ESTs *i 

115587 BE081342 Hi283037 HSPC039 protein rr 

115590 AA399477 Hs£7896 7-60protem J? 
115646 N36110 HsJ05971 sotote carrier fanty 2 (larjB^ 

115652 BE093589 Hs-38178 typc^tkal protein RJ23468 J« 
115655 AL048269 H&288544 Homo sapiens, done MGC:18)63, mRNA, complete cds 

115663 AI138785 Hs.40507 ESTs £° 
115676 AA953006 Hs*8143 ESTs 

115690 AA625132 Hs.44159 hypcflwBr^protetoFU21615 u _ ^ J* 

115693 AF231Q23 Hs£5173 cadhenHEGFUGsoveniJassG^receptw J* 

115715 BE395161 Hs.1390 pwteasare(prDscrne,n ™ 

15734 AI950339 Hs.40782 ESTs fr . 
115811 KMJM5434Ha48604 DKFZP434B168 protein 

115823 AI732742 Hs*7440 ESTs f\ 

115837 AI675217 Hs.42761 ESTs ]~ 

115844 AI373062 H&332938 hypothelicaJpro1etoMGC5370 4.4 

115866 AW062629 H&52081 KJAA0867 fKOteto J* 

[5875 N55669 H&333823 ntahondrial rftosorna! protein L13 J* 

15941 AI867451 Hs.46679 hypoftetical protein FU20739 *5 

,15968 AB037753 Hs£Z767 KIAA1332 proteh J* 

116003 BE275469 H&66493 Down syndrome critical legiongeneS \\ 

116011 AL359053 Hs57664 Homo sapiens mRNA fafltengfo insert cDNA done EUROIMAGE 2005735 2.4 

116108 AA770688 Hsl28777 H2Arrfstoreferrfy,nrernberL \\ 

16134 BE243834 H&50441 CGMM protein \\ 

H6189 N35719 Hs.44749 ESTs,Mc>tete*sirifctfto^ J-J 

16195 AW821113 Hs.72402 ESTs fl 

16238 AV660717 Hs.47144 DKFZP586N0819 protein ] l 

,16246 AF265555 Hs250646 bacrtwkaJ IAP lepeatcontahtog 6 ]•» 

116262 AI936442 H&59838 hypothec protein FLH0808 y 

I6298 AI955411 HsS4109 Homo sapiens cONAFU13634fe,cton8PlACE101 1133 I* 

16318 AF097645 H&58570 (tetetedtocarK»1;F^heficaseW3B/DICE1 43 

I6325 AJ472108 Hs.49303 Hofr»sapierecOMFlJ11663^ck^ 1.4 
116336 AL133033 Hs.4084 K1AA1Q25 protean 

116339 AKD0Q290 Hs.44033 dlpepfldytpepfldaseS 1 ^ 
J16350 AA497129 Hs.184771 nucteartactorl/C(CX^T^^traiisa^ 

H6358 A1149586 H&38125 toterferav^nAiced proteh 75, 52© 1-9 

16365 N50174 Hs.46765 ESTs ™ 

,16368 N90466 Ks.71109 KIAA1229 protein ^ 

J16417 AW499664 Hs.12484 Human clone 23826 mRNA sequence 7.4 

116436 AA161411 H&58668 duomosorrie 21 open reac^ frame 57 2.J 

116462 AF218313 H&236828 putative heHcase RUVBL \* 

116470 AI272141 Hs33484 SRY (sex deterrrMig region YHwx 4 <-\ 

H6470 AQ72141 Hs.83484 SW {sexdetermkraigregkmYHwx4 ]-2 

116575 AA312572 Hs^241 ptosprwinosffioW^^^ ^ 

116637 AKD01043 Hs.92033 frrtegnWirtad kfrtfise^ ^ 

Hs^11563 B^OUrymphorna7A 23 
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116700 

116705 

116732 

116921 

116926 

117034 

117132 

117247 

117276 

117284 

117367 

117368 

117382 

117412 

117557 

117588 

117745 

117754 

117879 

117881 

117904 

117911 

117933 

117983 

118078 

118301 

118429 

118472 

118488 

118509 

118528 

118656 

118670 

118698 

118737 

118925 

118984 

118986 

119206 

119235 

119235 

119265 

119279 

119238 

119338 

119349 

119403 

119478 

119486 

119513 

119601 

119602 

119676 

119682 

119774 

119780 

119789 

119805 

119818 

119863 

119905 

119966 

120132 

120206 

120248 

120253 

120269 



A180Q2Q2 H&317589 hypofoetfcal protein MGC10765 
AW074819 Hs.12313 hypo&efcal prate* FU14566 

AW152225 Hs.165909 ESTs, Weakly slmflar to 138022 hypothefcal protein [Usapfens) 
AW068115 Hs.821 bfetycan 
H73608 H&290830 ESTs 

YY1-associated factor 2 
p10-bWbg protefo 



Hs.180324 
Ks.42315 



U72209 
AJ393666 
N21032 
N71183 

AKD01701 Hs.183779 
AKM1793 Hs,425Q2 
AJ878942 H&90336 
AF150275 Hs.40173 
N32536 Hs.42645 
AF123050 Hs.44532 
N34895 Hs.44648 
B8294925 Hs.46680 
AA121673 Hs59757 
N54706 Hs303Q25 
AF161470 H&260622 
BE540675 HsJB2938 
AL137379 Hs.47125 
Y10518 Hs.116470 
AL11Q246 Hs.47367 
N54321 Hs.47790 
AA453902 H&293264 
AA243332 Hs74649 
AL157545 Hs.42179 
AJ277275 H&501Q2 
1422617 Hs.43228 
AI949952 Hs.49397 
A1458020 H&293287 
AA332845 Hs.152618 
AB033113 Hs50187 
AA199686 

N92293 Hs.206832 
AI668709 H&240722 
AF148713 Hs.125830 
W24781 H&293798 
AW453069 HsJ657 
AW453069 H&3G57 
BE539706 H&285363 
N57568 Hs.48028 
NMJD01241H3.155478 
AM17240 H&320836 
765004 Hs.163561 
All 17554 Hs.119908 
AJ624342 Hs.170042 
AI796730 H&55513 
W37933 

AKD00155 H3J91684 
AW675298 Hs233694 
AA243837 H&57787 
W61019 Hs57811 
AB032977 K&6298 
NM_016625Hs.191381 
BE393948 H&50915 
AJ223810 Hs.43213 
AA130970 HS58382 
AA081218 K&58608 
AW449064 Hs.119571 
AA703129 Ks56963 
W57554 Hs.125019 
H26735 HS31668 
AI924294 Hs.173259 
AA131376 Hs.326401 
AW131940 Hs.104030 



1.4 
3.4 
24 
8.3 
1.7 
3.4 
5.2 



gbcyx46©63l Scares melar»cyte2Nb^ 5 -5 



Hs.121805 Homo sapiens cDNA FU11971 ffe, done HEMB810Q12G8 
Hoiid sapiens CDNARJ10590 ft* c^ 
ESTs 

ATPase. H+ transporting, lysosomal (vacuolar proton pump), member J 
ESTs 

solute canter family 16 (monocarbaxyfc add transporters), member 6 



ESTs 

CGM2 protein 
zinc finger protein 281 
chromosome 11 open reading trame24 
butyrate-hduced transalpt 1 
hypc*ietka!piotelnMGC5370 
rypoftefcal protein RJ13912 
hypothetical proteki FU20048 
WAA1785 proteh 
EST 
ESTs 

cytochrome c oxidase suburut Vic 
bromcdomatn and PHD fmgerccrrfalmng, 3 
rapa-2 (rapa gene) 

Homo sapiens cDNA FU11835 fts, done HEMBA1008595 
ESTs 
ESTs 

ESTs, Moderatety similar to ZN91_HUMAN 7JH0 FINGER PROTEIN 91 {Rsaptensl 
WAA1287protBh 

gb2O75a09j1 Stratagem* hNT neuron (937233) HcmmsapIei»cDNAdcTOlMAGE647488ff 
ESTs. Moderately staflarto ALUB.HUMAN ALU SUBFAMILY SX &QUENCE CONTAMINATION 1.4 
ESTs,Mod8ra^sirnBartoALU8jnJMANALU SX SEQUENCE CONTAMINATION 3.6 



15 
2.0 
2.0 
2.1 
2.1 
1.4 
3.4 
3.4 
3.0 
1.9 
15 
5.7 
5.9 
1.7 
1.7 
5.4 
5.2 
2.6 
25 
4.1 
\2 
15 
7.4 
25 
1.2 
2.1 
5.2 



bladder cancer overexpressed protein 
WAA1710pro!efo 

ac^y-dependenlnemip(Dlec^prc^ 
adMty^epertdent r*uroprotedhreprote 
ESTs 
EST 

cycSnT2 

ESTs, Weakly sMar to A47582 B-cefl gn>^ factor piecuisor [Hjapiens J 
ESTs 

nucteobr protein NOP5/N0P58 

ESTs 

ESTs 

Empirically selected from AFFX smgteprobeset 

Hon»sa|^ntf^cDNADKFZp66^ 

hypothetical protein RJ11350 

ESTs 

ESTs 

KIAA1151proteh 

hypothetical proles) 

taikieln 5 (KU<5; KUU2; strata 

ESTs, WeaWy skrto to EFSJWMAN TRANSFORMATION-SENSITIVE PROTEIN IEF SSP 

hypothetical protein FU11101 

Homo sapiens cDNA FU 14208 fe, done NT2RP3003157 

cofegeri, type TO, alpha 1(Bi^^ 

ESTs 

tyrnphotd nuclear protein (LAF4)mRNA 
Homo sapiens done PP1498 unknown mRNA 
uncharacteri^bcmmanowpro 
fibroblast growm factor 12B 
ESTs 

123 
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1.7 

22 

1.6 
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25.1 

1.6 

1.3 
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120274 AA177051 gb:nc02a02j;1 NCLCGAPJ>r3 Homo sapiens cDNA done )MAGE:194 slm3ar to contains Mi 4.6 

1 20280 AA190577 gb2p52g0Zs1 Straiagene HeLa cefl s3 937216 Homo sapiens cDNA done 3*. mRNA sequence 2.0 

(20296 AW995911 Hs^9883' hypo^flcal protein FU233^ ] f 

I20297 AA191384 Hs.104072 ESTs, Weakly sbnBarto Z195_HUMAN ZWC FINGER PROTEIN 1£> [H^apfens] 15-2 
[20324 AA195517 Hs.191643 ESTs 

[20325 AA195651 Hs.104106 ESTs ** 
[20327 AKD00292 Hi278732 hypoihe6cal proter RJ20285 

J20336 N85785 Hs.181165 eukaryofctrareta^etoi^ 2 - 9 
120342 AW450669 Hs.45068 hypchefical protein DKFZp434l143 

120345 AA210722 Hs.104158 ESTs *f 

[20349 AW969481 Hs55189 hypofteOcalproteh ! B n 8 

J20352 R06859 Hs.193172 ESTs, Weakly sfrnfar to 138022 hypothetical protein [H.sap)ens] 5.0 

120356 AF000545 Hs296433 piitatopumerojc receptor 28-J 

120371 AA219305 Hs.104196 EST ™ 

[20382 AA228Q26 Hsl38774 ESTs ™ 

[20383 AL109983 Hs.125122 FSHpilrnafy response (liVRI. rat) hccx^ a/ 
120386 AW969665 Hs.154848 hypotieScal protein DKFZp434D0127 

S AV^^ l^S ml ModeraleV sinto to ALU7JU^ ALU SUBFAMILY SQ SEQUENCE CONTAMINATION 21.7 

120396 AA134006 Hs.79306 eukar^traretafionWrJation factor 4E 12*5 
120404 AB023230 Ha96427 WM1013 protesi 

120418 AW966893 Hs26813 Homo sapiens mRNA; cDNA DKFZp586F1323 (from done DKFZp586F1323) 11.4 

120423 AA236453 Hs.18978 Homo sapiens cDNA:FU22822fc, done KAIA3968 1-9 

120472 A1950087 p^*tf)5c02xl NCI_CGAPj<Jd12 Homo sapiens cDNA done 2, mRNA sequence 194 

120473 AA251973 Hs269988 ESTs 5jf 
12)484 AA253170 HSJ98473 EST 

1 20504 AA256837 o£zr84d10.s1 Scares _NhHMPu_S1 Homo sapiens cDNA done IMAGE£82387 3", mRNA sequence, 3.9 

120509 BE047718 H&96545 ESTs 9 ' A . 

J20520 AA258601 Hs.161731 EST JJ 

I20535 BE35Q244 Hfc96547 ESTs , ^ « 

[20551 AA279160 Hs.1 11407 Homo sapiens, done IMAGE3613029, mRNA, parte] cd3 52 
120570 AA280679 Hs271445 ESTs. Weakly skn2ar to ALULKUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 14.4 

1 2682 BE244830 H&284228 ZNF13&Ae proteh J 0 - 2 

I20590 AW372799 Hs.125790 Jeodhe-cteh repeakontaming 2 2-1 

120596 AA282074 H&237323 N^cetylgkrosarrin^^ 7.5 

120619 AW965339 Hs.111471 ESTs £5 
120624 AW407987 Hs.173518 Mphasa rtosphoprotefn homotog 
120539 AA286942 gfca5fil*s1 NCLOWjra 

120648 AA287095 Hs.140309 Homosapiens, done !MAGE3b77194,mRNA,r^crJs 55 

I20653 AW063659 Hs.191649 ESTs ' « 

120668 AW969638 Hs.1 12318 6.2kdproteh 22 

I2TJ669 BB36739 Hs.1(BS)9 ESTs ]* 
[20695 AA976503 g*»q3Qa04.s1 NQ_CGAP_GC4 Homo sapiens cDNA done 3* similar to contains PTR7i1 PTR7 46.8 

1206% A1821539 Hs57249 ESTs / ^ 

120713 AW449855 H&B6557 HcTOsar>ensd)r^RJ12727rls.doneNTaV20(^ i 55 

120718 AA292747 Hs57296 ESTs t 25 

! 20750 AI191410 Hs56693 ESTs,Mortarateryslrr^ 7.0 

20774 AI608909 Hs.193985 ESTs ™ 

I20807 AA346385 H&300Q2 SH3~cordaWng protein SH3GLB2; WAA1848 protein 65 

i 20809 AA346495 gb£ST52657 Fetal heart nHcrrosapterec 4.4 

I20938 AA386260 Hs.104632 EST \\ 

[20977 AA398155 Hs57600 ESTs \ A 

[20984 BE262951 Hs59052 ESTs 5.6 

120985 AJ219896 Hs57992 ESTs J* 

121011 AA398360 Hs57608 EST *J 

1 21 026 AM39713 Hs.165295 ESTs 3 ^ 

[21081 AA398721 Hs.186749 ESTs, HkjWy similar to 137550 mismatch repair protein MSH2 (H sapiens] 5.4 

121133 AA363307 rb57032 ESTs \\ 
121176 AL121523 Hs57774 ESTs ™ 

121223 AI0Q2110 Hs57169 ESTs, WeaHy simBaj to dJ667H122.1 [H^apfens] 22 
[21M0 AA4O3008 Hs50l927 C6.1A 1^ 

[21340 AW956981 H&97910 Homo sapiens cDNA FU13383 Bs, done PLACE1CS)1024 33 

121408 AA406137 Hs58019 EST nl . , ^ 

121439 AA410190 H&98076 ESTs, Weakty slmter to A47582 B-ceB growft factor precursor (H^apiens] 7.4 

121450 AA406430 Hs.105362 Homo sapiens, dons MGC:18257, mRNA, comptetecds 65 
j 21452 AW971063 Hs292882 ESTs 

121455 H58^6 Hs.15165 refirofcaddlrtfuced14 103 
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121457 W07404 Hs.144502 hypotfteOca) protein FU22055 3.4 

121496 AM42224 Hs.97900 ESTs 14-4 

121505 AA494172 Hs.194417 ESTs 13.1 

121508 AM02515 HaS7887 ESTs 28.0 

121513 AA416653 Hs.181510 ESTs &2 

121514 AA412112 gtezt68bOZs1 ScaresJesfeJJHT Homo sapiens cDMA done !MAGE:727563 3\ mRNA sequence. 2.6 
121549 AM12477 Hs.98142 EST t 7.4 

1 21 558 AA412497 gbztKg12^1 Soares_tesfls _NHT Homo sapiens cDNA done IMAGE:730150 J similar to contalns23 

121577 AM11970 HsS8098 EST 15 

121581 AA416568 gbzuQ5dQ.s1 SoaresJesfe_NHT Homo sapiens cDNA done 3\ mRNA sequence 6.1 

121589 AD001528 H&89718 spenritae synftase 33 

121594 AA626010 H&38247 ESTs ^ 

121622 AM16931 Hs.126065 ESTs ^ 

121655 AA421537 Hs.178072 Homo sapiens mRNA; cDNA DKFZp434B1023 {from done DKFZp43481023) 73 

121682 AA418160 H&66043 Homo sapiens cDNA FU13558 8s, done PLACE1D07743 2.0 

121690 AV660305 Hs.110286 ESTs *-7 

I21706 U55184 Hs.154145 hypdhetkal protein RJ115K 12.7 

121714 AA419225 K&98269 Homo sapiens cDNA FU11953 fis, done hEM88100Q883 8.1 

121729 A1943597 Hs38325 ESTs 18 

1 21731 AM21041 Hs.180744 ESTs 4.0 

121744 AA398784 Hs37514 ESTs 7-1 

[21748 BB36911 H&234545 hypoftefica! protein NUF2R 193 

121773 AB033022 Hs.156654 WAA1196 protein 73 

121775 AA421773 Hs.161006 ESTs 17 

[21776 AA292579 Hs.125133 hypothetical proti^FU22501 6.6 

121786 AB10774 Hs38376 ESTs 1a5 

1 21832 AW340797 Hs38434 ESTs 53 

121836 AA328348 Hs218289 ESTs 33 

121839 AA425691 Hs.191606 ESTs, HfctysimBar to WAA1 048 protein JUsaptensJ 53 

(21842 AFQ27406 Hs.104865 senWmreonlne kinase 23 2.7 

121847 AA446628 H&2799 caiBage fcHng protein 1 23 

121871 AW972668 H&2S3044 ESTs 2 9 

121882 AA426376 Hs.98459 ESTs 50 
121911 AA427950 gkzw50flKLs1 Soaes^trialjBtusJIhZHFB.gw Homo saptens cDNA done IMAGE:773499 3" 7J2 

121915 AA428179 H&223405 ESTs,MoteiatelysWtartoA46010)^^ 23 

121935 AA428647 Hs.98611 EST 23 

J21983 AA298760 Hs.180191 hypoftefical protein FU14904 3.4 

I219B5 AI862570 Hs299214 Hoinosap^ctorelMAGE^822295, rnRH\par^cds MA 

121995 AA210863 H&3532 nemo^to tinasa 33 

121999 AA430211 H&98668 EST 6.4 

J22009 AW292763 Hs.160822 Hc^ sapiens d)h^RJ2)863fe^ dona ADKA01 804 22 
122013 AA431085 Hs38706 ESTs 

[22038 W92142 Hs271963 ESTs, Weakly similar to AUELHUMAN ALU SUBFAMILY SC SEQUENCE (XKTAMINAT10N 13.1 

[22)50 A1453076 Hs.166109 aAV(embiywifcle^ almond 9.1 

J22OG0 AA431738 Hs£8750 EST 1&1 

122114 AW161023 Hs.104921 ESTs 1.5 

122188 AA398838 gbczBOdOtrl SoaiesjBs3sJWT 33 

122204 AA435936 Hs38842 EST 53 

J22246 AA329550 Ha29417 HCfrfaihdlng t ranscrt pg on fectorZhangfet 5.1 

I22257 AA436819 Hs38899 ESTs 53 

I22302 AA441801 Hs.104947 ESTs 5.8 

122341 AW601969 H&S9010 hypothetical protein FU22263sim^ . 23 

122356 AM43794 K&98390 ESTs 73 

122369 AA443985 H&3D3222 ESTs 122 

122371 AA868555 Hs.178222 ESTs 5.0 

122372 AA446006 Hs336677 EST 7.6 
122378 AB032948 Hsl21356 hypofteficaJ protein DKF2p762K2015 23 
122405 AA446572 H&303223 EST 23 
122412 AM46869 Hs.119316 ESTs 73 
122415 AA446918 H&99088 EST 1^ 
122418 AA446966 Ha99090 ESTs, Moderately sim3ar to sfanBar to WAAQ766[risaptens] 63 
I22440 AW505139 Hsj9460 Homo sapiens mRNA; cDNA DKFZp547C244 (from done DKFZp547C244) 23 
122446 AA447603 Hs39123 EST 1-8 
I22448 AA447626 Hs.99127 EST 33 
122458 A1266159 Hs.104980 ESTs 1-5 
122460 AW41S788 Hs39148 ESTs, Weakly similar to S43569 R01H10.6 proton - Caerromabdifis elegans IC.etegans] 9.7 
122464 AA448158 Hs.99152 EST 43 
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122490 AA448349 t&238151 EST *J 

122492 AA448417 Hs.104990 ESTs +1 

122502 AA204969 Hs_234863 Homo sapiens cDNA FU120B2 fis, done HEMBB1002492 1* 
122510 AA449232 Hs39195 ESTs 

122530 AW959741 Hs.40368 adaptor-related protefo complex l.sigma 2 suxma 

122547 AA779725 Hs.164589 ESTs f? 

I22555 AA194Q55 Hs^93858 ESTs J* 

I22570 AA452578 Hs^62S07 ESTs Jr . 

I22572 AA4526D1 Hs.99287 EST , ^ .^^^ I f 

122586 AKD01910 Hs^9303 Homo sapiens cDNA RJ11048 8s, done PUCE1004516 M 

122587 AB040893 Hs£968 WAA1460 protein 

I22596 AKJ28173 Hs.99329 ESTs I ' 

I22599 AL355841 Hs.99330 hypoflwfcal protem FU23588 *■» 

I226Q2 AA411925 Hs^01960 ESTs If, 

122607 AM53518 Hs38023 ESTs ™~ 

[22614 AM53630 Hs39339 EST ™ 

122616 AA453638 Rs.161873 ESTs 

122617 AI681535 Hs.148135 sertne/mreonlno kinase 33 « ,J * 
22618 AA453641 gtaWetfcsl Soares Jesfig J*fT Homo sapiens cDNActoriB3r,mRIW sequence 31.1 
I22622 AA453987 Hs.144802 ESTs ~* 
J2Z717 AM56859 Hs.178358 ESTs °* 
122762 AI376875 Hs.105119 ESTs ™ 

1 22829 AW20453O Hs£9500 ESTs ° ' 

I22834 AA461492 Hs£9545 Homo sapiens cDNAaJ10658^cloriehn2RP2a)6052 3.6 

122836 AM60581 Hs290996 ESTs „ 

122837 AM61509 Hs2935G5 ESTs, Weafy steto to putaihrc p150 [Rsapiens) £ 

122838 AA46G584 Hs334386 ESTs . 
122854 AA600235 Hsw9625 NIMA{rieverlnrrtftDsfea^aHebtedkmase6 
122856 AJ929374 HsJ5367 Src4awdapter 
122861 AA335721 H&119394 ESTs 

122866 BE539656 H&283705 ESTs J2 

122868 AF005216 Hs.115541 Janus tese 2 (a protein tyrosine kinase) « 

122870 AW576312 HU18722 Hon»sap^cDh^RJ21766fedoneC(XF7179 a* 

122872 AW081394 Hs£7103 ESTs " 

122879 AA769410 Hs.128654 ESTs J** 

(22907 AA470O74 Hs.169836 ESTs l 

j 22916 AA470140 Hs529170 EST Y 
122981 AM78951 Hs.105629 ESTs 

123013 AW968324 Hs.17384 ESTs ,_ mn4 ^nn Jr 

[23016 AW338067 Hs323231 Homo saptens cDNA RJ11946 fis. done HEMBB100O709 ^? 

I23Q34 AL359571 Hs.44054 nmeh (GSK38 Interacting protein) " 

123072 A13826CO Hs.104308 ESTs, Weakly simfer to K1AA1395 proton [H^aplensl Wj 

123082 AA485360 H&105QS1 ESTs ** 

I23088 AI343652 Hs.105667 ESTs " 

123110 AA486256 Hs.193510 EST '? 

[23114 BE304942 tt*265848 rrryomegaJh . , , nlj . . fL 

123131 T52D27 rt*271795 ESTs, WeaMy sknQarto I38C22 hypothetical proten (H^aptensj ^ 

123132 AH61582 H&324179 Homo sapiens cDrlAaJ12371fi5 t cIorieMAMMA1lW2434 15-6 
[23136 AW451999 Hs.194024 ESTs £ 
123149 AI734179 Hs.105676 ESTs ™ 

S ESTs, Weakly simBar to RMS1_HUMAN ^^^^J^^^TIP ^wSftP 7^R^5^Xni£rS? nS ^ " 

[23315 AA496369 gfc»37d10.s1 Soares ovary tumor NbHOT Homo saptens cDNA done IMAGE755827 3" similar to4.1 

[23369 AA5C4757 Hs.105738 ESTs ™ 

[23394 AA731404 Hs.105510 ESTs ?? 

I23433 AW450922 Hs.112478 ESTs * ' 

[23466 AA599042 H&112503 EST . ^ . „ l '\ 

123470 AW303285 Hs303632 HurnanDMseojiencsf^dom 

[23471 ABQ21644 Hs.197219 zmc finger protem 14 (KQX 6) ~ 
[23475 BB439553 Hs35052B Homo sapler^ clone 0MGEtfO38694,r^ 

1 23482 N95Q59 Hs£509B ESTs 1* 

123486 BED19072 Hs334802 r^sapterecDHAFU14680fis t d^ 

123508 AW380388 Hs.155546 K1AA1080 protein; Gofcl-assodated, gamma-adapfa ear corfaWrig, W-wno^ protetn l Z2 

[23615 AA609170 gbaf12a12j1 ScaresJestfeJWT Homo sapiens cDKAdoney. mRNA sequence 7.8 
123619 AA602964 9bTC^(fl2a1 NCLCGAPJ^Homosapto 
I^KB AA609364 5b^1dMirt SoaresJeslisJOT 

(23674 AI269609 Hs.105187 kinesln protein 9 gene * J 
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23735 NM.013241HS.95231 

23738 AA609891 Hs. 112777 

23753 AA609955 Hs234951 

23804 AA620464 Hs361915 
123811 AA620586 

23951 AB012922 Hs.173043 

23383 /U272267 Hs.146178 

24001 L42542 Hs.75447 

124008 AM47155 Hs270016 

124070 A1950314 Hs.154762 

24074 H05635 Hs294000 

24178 BE463721 HsS7101 

24203 AA372796 H&268339 

124352 AA640891 Hs.102408 

24375 D87454 Hs.192966 
I24385 AE67847 

24390 AA317338 Hs.7535 

24391 AF155099 Hs.279780 
24417 N34Q59 

24428 H13540 H*82202 

24440 AA532519 Hs.129043 

24466 R10084 Hs. 11 331 9 
24482 N53935 

24498 H79433 Ha268997 

24515 AA669097 Hs.109370 

24608 N71076 Hs.102800 
24631 NM.014053tfe270594 

24634 AI765123 Hs.143671 

24637 AA160474 Hs.75798 

24642 AW968856 KsJZ78569 

24649 N92593 Hs313Q54 

I24656 AW2977Q2 Hs.102915 

124661 R48170 HsJ8436 

24683 AA381661 Hs.119878 

124712 R09166 Ha.191148 

24735 R22952 Hs268685 

24761 AA374756 H&S3560 

[24768 AW368528 Hs.100855 

24775 R41772 Hs.100878 

124777 R41933 Hs.140237 

I24788 R43543 Hs.100912 

124809 AL355722 Hs.106875 

124811 R460S8 H&288912 

124812 R47948 Hs.188732 
124822 AA418160 H&86043 
124825 AA501GG9 H1336693 
124833 AW975868 Hs294100 
24857 R63652 Hs.137190 
24860 R65763 Hs.101477 
24863 AB82555 Ka.127950 
24876 AF135422 Hsl27059 
124878 BE397530 Hs^88057 

124902 H37941 Hs.101883 

124903 AW296713 H&221441 
24930 AHJ76343 Hs.173939 
124942 R99978 Hs268892 
124958 AW78645 Hi431 
24980 T40841 H&98681 
125002 T59338 B&269463 
25047 T79815 Hsl279793 
25051 T79956 Hs.100588 
25056 T81310 Hs.100592 
25101 AI472068 Hs^86236 
25113 T96585 Hsl3Q2270 
25115 T97341 

25125 A1222382 Hs240767 
25147 W38150 



FH1/FH2 domakKxmfaWng protein 
EST 

Hmtingtin rnteracfinQ protein E 

EST, Weak)/ stmBar to S65657 alpha-1 ^adrenergic receptor spBce form 2 [Haptens] 

gb3e60g05^1 Stralagene lung carcinoma 937218 Homo sapiens cONA done IMAGES51 320 3 1 

metastasis-associated 1-fite 1 

choHne dehydrogenase 

ralA Uhding proteh 1 

ESTs 

HIV-1 rev bwfog protein 2 

topotsomerase-felaled function proteki4-2 

putative 6 prc^esvcoupled recc^Iu 

ESTs, WeaHy sfmBarto AF161356 1 KSPC093 fRsaplens) 

ESTs 

K1AA0265 protein 

gfeaq4te10Jrt Stantoy Frontal NBpoof2Homo sapiens dmctor»sim8ar to oontaira 
COBW-Qbb proteh 
NY-REN-18 antigen 
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HiinanDNAsecjuenceftom done 989H1 Ion 
ktoesin heavy chain member 2 
gfcyv5^.s1Soa^tieta)Bvers^ 
ESTs 
ESTs 

ESTs, WeaWysinto to neuronal thread p 
FLVCR protein 

Homo saptens cONA FU13533 fe, done PLACE1006371 
hypotheCca) protein 
sor&tg nexin 17 
ESTs 
ESTs 
EphBI 

ESTs, WeaMy stmSar to M3K9 JiUMAN RflTOGBI-ACnVATED PROTEIN KINASE KINASE 
ESTs 
ESTs 

Homo sapiens mRNA far WAA1771 protefc partial cds 
ESTs 
ESTs 

ESTs, Weakly sbnflar to AUU1 .HUMAN ALU SUBFAMILY J SEQUB4CE 
HomosapfenscONA: FU22726fis.doneHSI15005 
Homo sapiens EST from done 35214, fidl insert 
rrypoffieScal protein FU22604 
ESTs 

Hcmo sapiens CONARJ13558 lis, done PIACE1O07743 , 
ESTs 
ESTs 
ESTs 
EST 

brorTXxbvraOT-contafrtirtg 1 
GOP-nianr«sepyrophosphofytaseA 
hypothetical protein FU22242 
ESTs 
ESTs 

ESTs, Weakly shnBar to ALUBJfUMAN Dll ALU CLASS B WARNING ENTRY ID [Ksaplens] 
ESTs, Moderately sinte to B340B7r^ 
murine leukemia viral (hmMJcwgenerKradog 
ESTs 

ESTs. WeaHy similar to ALUIJiUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 
ESTs 
EST 
ESTs 

K1AA1856 protein 

ESTs, WeaWy stmflarto AIJUFJiUMAN DM ALU CLASS F WARNING ENTRY Hi [Rsapiensl 
gteye57e05.s1 Scares feta] liver spieen 1NFLS Hcmo sapJerecONA done IMAGE.^ 
Human DNAseo^ence from ctoneRP1-12G14cfl 

Enpintally select from AFFXsi^ 1.7 
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125161 W44657 Hs.144232 EST t % ]™ 

25249 AA630863 Hs.131375 ESTs, Moderately similar to ALUB_HUMAN IUI ALU CLASS B WARNING ENTRY 111 [H^pfens] 13 

25255 AF098162 Hs.118631 timeless (Droa)ph2a) horrwlog ?-J 

125279 AW401809 Hs.4779 K1AA1 150 protein J* 

125280 AI123705 Hs.106932 ESTs ?.0 
25298 AW972542 Hs789008 Homo sapiens cDNA: FU21814 8s, done HEP01068 1-5 
I25660 AW292171 Hs^3978 scafibWatechrnerrtfectorB 5 9 
25827 NM.003403HS.97496 YY1 transcription factor 12 
25891 U29589 Hs.7138 choBnergte receptor, muscarhte 3 6.4 
26005 AW409701 Hs.1578 bacutortal IAP lepeatcontahmg 5 (survMn) 143 
26202 AA157632 H&272630 vacuolar proton pump ^polypeptide 2.4 
26695 AA643322 Hs.172028 a olshtegrfo arri metatopro^ 91 
27050 AW411066 H&274351 CGW9 protein 1™ 
27274 AW968158 Hs38582 Homo sapiens cDNA FU12789 8s. done KT2RP20O1947 123 
28355 AW293012 Hs.161623 ESTs ™ 
28493 D874G6 Hs^40112 WAAD276 protein 3.1 
28493 D87466 H&24Q112 KIAA0276 protein 1-3 
28522 BE173977 Hs.10098 putaftre nucleolar RNAhefcase 94 

28527 AA5045B3 Hs.101047 trarecrPonfcKtor3(E2AfoHrni^^ 1-5 

28528 R39234 H&251699 ESTs, WeaMy stmBar to DN4-GGTR14 pisaptens] 23 
28595 U31875 H&J272499 shortchah alcohol o^hy*ogerasefemBy member 12.1 
28599 NM.015366HS.102336 RhoGTPase activating protein 8 z3 
28604 AI879099 Hs.102397 GK7M forgonadofa^ hducibte transcription repressor^ 13 
28608 BE267994 Hs.102419 ara: finger protein 7.1 
28625 AB037841 Hs.102652 hypothetical profek) ASH1 1-3 
128629 AL096748 Hs.102708 DKFZP434A043 protein 32 
28639 AW582962 Hs.102B97 CGW7 protein 23 
2B656 AA458542 Hs.10326 coatorner protein complex, subunaepsDon 1-4 
28656 AA458542 Hs.10326 coatoroer protein complex, subunSepsBo 13 
28658 BE397354 Hs324830 (fiptheria toxin resistance proteh required for dphmarrdde biosynthesis (SacdrarorrycesHfte 2 Z4 
28670 AA975486 Hs.103441 Homo sapiens, StmBar to RIKEN cDNA 1700O10L19 gene, done MGC:16214, mRNA, complete cds 7.1 
28691 W27939 Hs.103834 rrypdhefcai protein MGC5576 7.7 
28695 BE081143 Hs£25977 mrieajiece^coac8wtor3 33 
28700 Y15221 Hs.103982 smal hducfote cytokine subfemDy B (Cys-X-Cys), member 11 13 
28714 TB5231 Hs.179661 fcibufin,beta5 73 
28717 AK0O1564 Hs.104222 hypofliefical proteh FU10702 53 
28733 BE147740 Hs.104558 ESTs, fctoofciatety sirriaar to 138022 hyj)c*efcd proteh fH^apiens) 2.7 
28737 AF292100 Hs.104613 RF42homolog 23 
28742 AA307211 Hs251531 proteasome (prosome, niacropeJh) subunil, alpha type, 4 4.4 

28746 AI470163 Hs323342 acfin related protein 213 conplBX.subui*4(2b>tD) 22 

28747 AB027249 Ks.104741 PDZ-bhc^ lonase;T^on#rate^ 23 
28772 BE3Q2796 Hs.105097 thymkflw Whase 1, stAAle 53 
28781 N71826 Hs.105485 smafl nuclear ribortucteqxo^ 535 
28797 NMJHE975HS.1059Z7 stem erf growfo fedon tyniptocyte 133 
28806 AW630942 Ns.106061 RD RNA-bhdng proteh 23 
28814 AW248431 H&256526 rnxdearpnahmh A recogr^kin factor 21 
28830 BE281170 Hs.108357 vaiosfriofitahhg protein 53 
28835 AKD01731 Hs.106390 Homo santera mRNA; cDNA DKFZp58&fl)924 (from done DKFZp586H0924) 13 
28854 BE159181 Hs.168232 tiypoftefcal proteh FU13855 22 
28854 BE159181 Hs.168232 hypothetical proteh FU13855 13 
28668 AA419006 Hs.106730 chromubuiie 22 open reading frame 3 33 
28868 AA419TO8 Hs.106730 ctoorf»scfne22c^reaa^frant93 22 
28871 AF189723 Hs.106778 ATPase, Ca++ transporfing, type 2C, member 1 13 
28891 F34856 Hs^92457 Homo sapiens, cte»(yiGW6362 i n^conipteteab 133 
28906 R57988 Hs.10706 epflhefet proteh lost hreopbsmbeta 47 
28920 AA622037 Ks.166468 prograrnrnedce8deafl)5 1*4 
28925 R67419 H&21851 Homo sapiens cONAFU12900fis, doneNT2RP2004321 13 
28946 Y13153 Hs.107318 kyroirenhe3-moroc(xyg8r^ 72 
28949 AA009647 Hs3850 a (Mrtegrh and metafloprote^ 24 
28958 AW139032 Hs.107376 hypofce&al proteh DKFZp434N035 13 
128959 AI580127 Hs.107381 hypofte&al proteh FU11200 103 
28965 AW150697 Hs.107418 ESTs 14 
i28970 AI375672 Hs.165028 ESTs 13 
28975 BE560779 Hs^84233 KICE^5 proteh 143 
28979 AW271217 Hsl281434 Homo sapiens cOrMaJ14028fis,ctor»HEMaA1003838 13 
289% AI816224 Hs.107747 DKFZP566C243 proteh 13 
29019 AJ950087 gbwq05c02j(1 NCLCGAPJ<ld12 Homo sapiens cDNA done 3", mRNA sequence 23 
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129021 AL044675 Hs.173081 WAAQ530 protein ™ 

129021 AL044675 Hs.173081 WAAQ530 protein ^ 
I29032 R80088 Hs.108104 uWquffirKan^igafing enzyme E2U 3 ^ tofv>04rU _. , 

1 29076 AW296806 H&326234 ESTs, Highly simter to T46422 hypothetical protein DKFZp434M2023.1 [Haptens] 5.0 

129078 AI351010 Hs.102267 lysosomal " 

1 29088 AA744610 Hs.194431 paSadto _ 7 

129095 112350 Hs.108623 thrombospondln 2 

129096 M463189 HsJ288906 WW DwnaWfcrrtaHng Gene f: 9 

129097 BE243933 Hs.108642 zinc finger protein 22 (KOX 15) 

I29099 AF146074 Hs.108660 ATP^MnrJng cassette, siikfernDy C (CFTR/MRP). member 5 j>° 

129136 WS3048 H&250723 hypothetical proNnMGC2747 J* 

129149 AA356620 Hs.108947 WAA0050 gene product J» 

129172 AW162916 Hs241576 hypotietol proteh PR02577 J» 

129192 AA286914 Hs.183299 ESTs frl 

129194 AA150797 Hs.109276 btexbiproteh ** 
129198 N57532 Hs.109315 WAA1415 protein 
129207 AI934365 Hs.109439 osteoglycte (osteoteducfive factor, mfrnecan) 

129228 U40714 HsJ239307 tyrcsyHRNA synthetase ~ 

129229 AF013758 Hs.109643 polyadenytate binding prc&inJiteracfing protein 1 

129254 AA252468 Hs.1098 DKFZp434J1813 protein 

129255 AS61727 Hs.109804 H1 WstonefamDy, member X 

129288 W26392 Hs.1 10080 ESTs, Weaidy simflar to S13495 pfegnancy zone r*otein [H.saptens] 9* 

I29296 AIQ51967 Ha. 110122 ESTs % 

[29323 AA287239 Hs£518 Homo saptens cONA RJ1 1311 fe, cfone PLACE1010102 5.1 

129340 H75334 Hs.11050 F-box only protein 9 ™ 

129347 BE614192 H&279869 melanon»«s«ia«ed anfi^ '* 

(29362 U3Q246 Hs.1 10736 solute carrier tamfly 12 (soflumfodassiumtok^ transporters), member 2 6.7 

I29366 BE220806 Hs.184697 Homo saptens done 23785 mRNAsequence J* 

I29370 A1686379 Ha. 11 0796 SAR1 protein ™ 

1 29372 NML016039HS.110803 Ca99 protein ^ 

129403 AF149785 Hs.1 11126 pfluflaiy tuiiw 4 r an sform&i g 1 interacting protein 

129404 A1267700 Hs317584 ESTs ™ 
J29404 AI267700 Hs317584 ESTs tf, 
129423 AA204686 Hs234149 rrypofl»ficalprotehFlJ20S47 
129449 A1096988 Hs.1 11 554 ADP-ribosytatjontactor-»e7 
I29453 AW974265 Hs.111632 Lsm3 protein *f 
I29482 AA188185 Hs289043 spoxftl J* 
I29482 AA188185 H&289043 sproffin *° 
129513 AW843633 Hs3Q6163 hypoftefical protein AL1101 15 ™ 
129515 AF255303 Hs.112227 membrane-associated nuctefc acid binding protein 

I29527 AA769221 HsJ270847 delta-tebuin « 

129559 W01296 Hs.1 1360 hypofltedcat protein RJ14784 

129560 AA317841 Hs.7845 hypoftetol protein MGC2752 

1 29570 AB23097 Hs.11441 Ghronttsane1openreao^ftama8 / 7? 

j 29575 F08282 Ha278428 progesflh Induced proteta f 1» 

129587 H14718 Hs.1 1506 Human done 23589 mRNAsequence 

129588 BE408300 Hs301862 postmeiofcsepegaBon increased 2-lflBB 9 1-4 
129591 N57423 Hs.179898 HSTOK5 protein J* 
129594 AW403724 Hs36989 ccap^jlato factor VII (senro ™ 
129596 AF035537 Hs.1 15521 RB/3 (yeast r>orretog)^c^^ 1-6 

129628 U38945 Hs.1 174 cycfin^tependentttr^inriib^ 22 
13628 U38945 Hs.1174 cydlrHleperidentkto ™ 

129629 AKD00398 Hs.1 1747 hypotofca) protein FU20391 

I29649 AD000092 Hs.16488 cairefcuDn **, 

I29675 NML015556HS.172180 WAA0440 protein J*} 

I29680 U03749 ajbttananchrarogia^ "r 1 

129689 AW748482 Hs.77873 B7hornobg3 

129702 AB04966 Hs.12035 ESTs,Wea«ysbTiBartoD8Q22hyrjoIheIk^ 7-4 

129720 AA156214 Hs.12152 APMCF1 protein 

129721 NML001415Hs^11539 aftaiyofetanstaltooin^^ " 
129726 H15474 Hs.132898 tetiyadddesaturasel &3 

129778 AKD01676 Ks.12457 hypofteflcal protein FU1081 4 2j 

129779 AA394090 Hs.12460 Homo sapiens done 23870 mRNA sequence 5.4 
I29800 AF052112 Hs.12540 lysosomal 

I29806 AB023148 Hs.173373 KIAA0931 protein ^ 

129815 BE565817 Hs^6498 hypoftefcal protein RJ21657 f\ 

129840 NM_006590Hs.12820 SnRNPassemb^defecbvel homotog ^ 
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29861 AL049999 K&65963 DKFZP564M182 protein 22 

29864 A1393237 Hs.129914 runheiatedtoarfi»|jrjDn 1.7 

29869 AI222069 Hs.130/15 hypcrftelical proteki sbhDartD mouse Dmipi 2.7 

29922 AFD42379 Hs.13386 gamma-tobuin complex protein 2 4.5 

29945 BE514376 Hs.165998 PAH mRNMtaflng protein 1-8 

299S3 AM12195 Ks.13740 ESTs 25 

29972 AW753185 Hs.180628 dynajrinl-TO© 1-8 

29983 U09848 Hs.132390 zinc finger protein 36 (KOX 18) 1-3 

29989 AB015856 H&247433 ac6va^transcrip8oniactor6 4.0 

3(X)10 AA301116 Hs.142838 luidec^pho^hoproteh Nopp34 13 

30081 AA287325 Hs.14713 ESTs 4.0 

30082 S73265 Hs.1473 gastrtn-retea^ng pepWe 13 
30097 AL046962 Ha/14845 fortheadbax03A 23 
30100 AL135561 Hs.14891 hypofteBcal protein FU21047 23 
130111 X530Q2 Hs.149846 Megrin,beta5 23 
30112 AA916785 Hs.180610 spDdhg fector prollne^lutanihe ric^ 3.0 
30112 AA916785 Hs.180610 spfidng factor proSnefelutenAiench( 2.1 
30128 L76937 Hs.150477 Werner syndrome 13 
30135 AA311426 H&21635 tubuSn, gamma 1 61 

30211 NM_003358Hs23703 ESTs. Moderately dmflar to CEGTJ1UMAN CERAM1DE OOCOSYL7RANSFERASE [H^apiens] 1.6 

30212 D80001 Hs.152629 K1AA0179 protein 13 
130236 R85367 Ha31957 spBdng 6«ta, arghtoe&eri^^ 2.0 

30241 AL035588 Hs.153203 1^ fem3y inhibtor 3.2 

30242 X79201 Hs.153221 syra^sarcorr^transkx^ 5.4 
30249 D81983 Hs3228S2 GAS2^efatedonchrornosoma22 4.8 
30263 NM_002497Hs.1537O4 NIMA (never htiflosisc^»aHelatedltoase2 1.4 
30287 AA4790O5 Hs.154036 tumor suppnssslng suttransferabte candidate 3 2.6 
30310 AB011121 Hs.154248 amyotrophic lateral sdero^ 63 
30353 Z19084 Hs.172210 MUF1prote&i 6-2 

30356 AF127577 Ha.155017 riudear receptor Intera^ protein 1 2.4 

30357 AJ224442 Hs.155020 pufefive meftyftransferase 3.4 
30359 W_013449Hs277401 toorrxxtornajhao>cen!tozte 63 
30367 AL135301 Hs3768 hypothetical protein FU10849 1.4 
30372 AI077464 Hs^Oll RNA NnrJhg mofif protein 9 33 
30393 N89487 Hs.155291 WAAD005 gene product 18 
30399 AW374106 Hs.155356 hypcfte^rprori*MG<2840 3.4 
30407 BE385099 Hs334727 byprihe&a) protefci MGC3017 23 
130409 NNL001197KS.155419 BCL2-lnteracfe)g Wter (apc^AJsis^rxfticing) 2.7 
130419 AF037448 Hs.155489 NSI-associated protein 1 16 
.30441 U63630 Hs.155637 proteh kinase, DNAaftrated. cafary8cp<3lypepfide 23 
130448 BE513202 Hs.15589 PPARbhdtog protein 35 

3)455 D90O41 Hs.155956 riscsiyftransftsase 1 (a^^ 333 

30455 D90041 Hs.155956 r^ce^Bransfeiasa 1 (arytambie N^ce^f 4.6 

30471 AL121438 Hs.183706 addudn 1 (alpha) 2.7 

.30485 BE245B51 Hs.180779 HfflhstonefemfyrrernberB 5.0 

,30487 U49844 Hs77613 ataxia telangiectasia and Rad3 related 43 

30498 L38951 Hs.180446 kaiyopheite (frnportin) beta 1 13 

30503 BE208491 Hs^95112 WAA0618gene product 10.1 

30511 132137 H&15B4 cartilage djyjmerfcmatnx 6.1 

130511 L32137 Ks.1584 tankage cSajmeric matrix pr^ 53 

30526 AW876523 Hs/15829 hypofrefcal protein FU12910 2.1 

30542 U64675 Hs.179825 RAN Wnolng protein 2-Bte 1 73 

30544 AA321238 Hs.4310 euKaryc^trarislator^lato factor 1A 13 

,30553 AFD62649 \b252SBr pluitarytumor-ta^^ 14.4 

30556 A1907018 Hs.15977 Empktra^setectedtaAFr^sIr^prob^ 4.7 

30567 AA383092 H&1608 repfcafionprotefc»A3(14kD) 73 

30568 AA232119 Hs.16085 putafoe Gi*oteto coupled receptor 33 
I30574 AF083206 H&16178 apor^osfe antagonizing uari^^ 1-2 
30586 AB0O7891 Hs.16349 HAAD431 protein 53 
30598 ALD42210 Hs.16493 hypolhefcal protein DKFZp76^2316; WAA1803 protein 1.4 
130601 AA609738 Ks.16525 ESTs 13 
130614 AJ354355 Hs.16697 dc«n*egutata<rftrareaPon1, W 13 
,30617 M90516 Hs.1674 gtotBrnfofrftuctose^ 12.1 
130617 M90516 Hs.1674 gjutarririe^riictose^Jb^^ 2.4 
30618 AA383439 Hs.16758 Spir-1 protein 153 
,30667 BE246961 Hs.17639 rtorno sapiens tibtyffinpr^ 133 
130674 AL048842 Hs.194019 attractin 13 
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130675 AA442233 Hs.17731 hypomeiica! protein RJ12892 ^ 

130692 AA652501 Hs.13561 hypoftetlcal proteh MGC4692 5.0 
I30893 R68537 Hs.17962 ESTs 

130712 AJ271881 HsJZ79762 broniodomahKxmtainbg 7 ™ 

130714 AB48274 Hs.18212 DNAsegmema»dH0jnc50iraX(^^ 25 

130730 ABQ07920 Hs.18586 KIAAD451 gene product 3.7 

12)744 K59696 Hs.18747 POP7(processtogcf preai^ *1 

SJ751 AFO52105 Hs.18879 chjornosoitB 12 open rcatfing frame 1-4 

130757 AL036067 Hs.18925 protein x 0001 5.7 

j 30768 AF256GZ7 Hs211562 ATWwio^ig cassette, sub*n^ 5 j[ 

I30789 AKD00355 Hs5899 sWuh(sBertma^1ypeMor^^ 52 

130815 ABOIBSB Hs.19822 SEC24(acefev^)fe^g^fan% l mejT*erD 15 

130836 J05068 Hs3012 trarecobalamin J (vitamin Btt 15.7 

130841 AL157488 Hs525825 Homosap^d)NAFU20848fis,ck^ADKA01732 2Ji 

130843 AA447492 Hs^0183 ESTs, WeaMysknSar to AF1647931profteirix 01 3 [Haptens] 15 

130844 U76248 Hs£0191 seven bi aJbsento(DcosopMa)homDlog2 3.4 
130855 AJ243706 Hs.143323 putafive DNAtoromatfo tainting molif 1-7 
;30861 NM_016578Hs^0509 HBV pX associated proteM 1-9 

130879 NKL003416HS5076 2mcfmg^pfdeh7(KCa4 l cteneHF.16) 1-4 

130880 BB14434 HsJ20830 WnesJrvfte2 £« 
130892 AL120837 Hs3Q993 hjgh-gJucoseHregubted protein 8 ^ 
130898 AB033078 Hs.186613 sphlngosfri8>1-phosphatB lyase 1 1.7 
130911 BE409769 Hs^1189 DnaJ (Hsp40) homoiog, subfemflyA t member2 15 
130919 K79110 Hi21276 coBag8n,type IV, alpha 3 (Goodpasture anfyen)binoTng protein 25 
130944 BE382657 H&21486 signal transducer and acfivatordtransc^^ 5.4 
130971 N39842 Hs501444 WAA1673 22 
I30932 BE398091 Hs.74316 desrrcpbHn (DPI. DPU) ]•? 
130993 T97401 H*21929 ESTs \\ 
131005 AV858308 H&2210 ttfyroid hoimone receptor Weractor3 ]* 
131028 AJ879165 H&2227 CCAAT/enhaiwKr^pr^ 1-2 
131042 AI826288 Hs.171637 hypofi>e^ protein MGC2628 15 
131046 AA321649 Hs2248 srraB b&Jdbte cytokfoe su^^ 7.4 

131046 AA321649 Hs2248 smafi inducible cytokine subfamily B (Cy 3 0 

131047 H23230 hfe^2461 ESTs, Mortsrately slmflarto A46010 X«Bd retinopathy protein [H^apiens] 1.7 
131060 AA194422 H&22554 myoshVI 5.1 
131060 AA194422 H&22564 myosin VI 25 
{31070 N53344 Hs22607 ESTs 

131076 AA749230 H&26433 dcWf>tosphate (UD^^ 2.0 

131076 AA749230 Hs^6433 c^feiiyVihosphate (UDP-Hecetylgkiccsam I- 9 

131099 AL133353 Hs226581 COX15(yeasQhomc^(*tD^ 7.0 

131174 NM_0O654OHs^9131 mictear receptor coacfivator 2 1-9 

131185 BE280074 H&23960 cycfinBf 5.8 

131206 AW138839 Hs34210 ESTs f° 

131213 AA885G99 H&24332 CGi-26 protein ™ 

J31225 H62087 Hs31658 thyrokJ hormone receptor^^ 75 

131231 N47468 Hs59757 zinc finger protein 281 25 

131233 D89053 Hs£68012 feffly-adaMierayriaABgas^ 35 

131243 AW383256 Hs34752 spectrin StoAimafobteaTngpiotebl 25 

131245 ALD80080 Hs^4766 foforedjxinabmiin-con^^ 25 

131247 AL043100 Hs526190 fatty add amide hydrobse 55 

131281 AA251716 Kl25227 ESTs 5J 

131283 XB0038 Hs339713 Hano sapiens ctoneF19374APO Eugene duster 15 

131305 AV656017 Hs.184325 CGI-76 protein 5.0 

131320 AA505691 Hs.145696 spDdng factor (CC15) \\ 

131339 APD58698 H&25812 Nijrnegen breakage syndrome 1 (nfcrin) 2.6 

/31339 AF058696 H&25812 Nfcnegen breakage syndrome 1 (rdtofri) 25 

131375 AW293165 Hs.143134 ESTs 5.4 

[31390 BE269388 Hs.182698 mitochondrial rfbosomal proWnL20 55 

[31410 BE259110 H&279836 HSPC168 protek) 22 

131412 NM_012247Hs.124027 SBB40PHOSPHA1E SYNTHETASE ; Human selenium donor protein 25 

131429 AL5463Q2 Hs2d750 hypoftetal protein FU21908 I- 4 

131458 BE297567 H&27047 trypothetk^ protein RJ20392 1-7 

131475 AA992841 H&27263 HAA145B protein 25 

131501 AV66195B HsJBSOf GKD01 protein 25 

131501 AV661958 Hs5207 GKD01 protein 15 

131511 AA732153 Hs27865 Hccrx> sapiens d)NA:ai^ 25 

131528 AU076408 H&28309 UDP-gtuoose c^hydrogenase 16 
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131532 BE268278 H&28333 hypothdical protein MGCS92 

131543 AW966881 Hs.41639 pfogrammed ceB deafii 2 

131544 AL355715 Hs.28555 programmed eel deaft 9 (PDCD8) 
131562 MNL003512Hs^8777 H2AhbtDn8fam3y, member L 

131564 T93500 Ha28792 Homo sapiens cDNAFUl 1041 fis,dor»PU\CE10O4405 
131564 T93500 Hs^8792 Homo sapiens cDNA FU11041 fis, done PL 
131569 AL389951 Hs.271623 nucteoporfnSOW) 

131618 BE393822 Hs^9645 Homo sapiens mRNA; cDNA DKFZp761C029 (from clone DKFZp761C029); partial cds 

131622 R78195 Ha39692 Homo sapiens cDNA FU11436 fis, done HB©A100t213 

131623 AB037791 Hs^9716 hypo&re8caiprotBlnRJ10980 
131623 AB037791 HsJ29716 hypofite^ protein FU1Q380 
131643 AW410601 Hs30026 HSPC182 proteki 

ESTs 

KIAA08M protein 
uraci-DNAg!ycosybse2 
K1AA0124 protein 
putative DNAbfenftig profit 

131722 vww hsa»i phcsphafcosylpynJcJ^^ 

131737 AKD01641 Hs31323 fohBto of kappa BgWpor^^ 
131760 X76732 Hs3164 nucteobhdh2 
131760 X76732 Hs3164 nucteoblrafin 2 
131763 AIB76932 Hs317 topoisomeTase (DNA) 1 
131772 AA3825S0 Hs.170980 WAA0948 protein 
131774 BE267158 Hs.169474 DKFZP586J01 19 protein 
131787 D87077 Hs.195275 KIAAQ240 protein 

131793 AW9661Z7 Hs32246 Homo sapiens cDNA RJ14656 fis, clone NT2RP2002438 

131795 BE501849 Hs32317 hfchwnoblty group 2DB 

131798 X8609& Hs301449 atterK^5E1AbWhgproteh 

131817 U20536 Hs.3280 caspas86,apoptosb-reiaJBdcystefneprcteas8 

TATA boot b&ufiog protein (TBP)-assodated factor, RNA polymerase III, GTF3B subunft 2 
ESTs 

hypofcefcat protein MGC3178 
ESTs 
ESTs 



131653 AW960597 Hs30164 
131656 A1218918 Hs302Q9 
131669 X52488 
131692 BE559681 
131714 AA642831 
131722 D13757 



HsJHMI 
Hs30736 
Hs31016 
Hs.311 



131824 U28838 Hs32935 

131850 AB51317 Hs33184 

131878 AA083764 Hs.6101 

131885 BB02341 Hs3402 

131885 BBQ2341 Hs3402 



35 131887 W17064 Hs332848 SWUS^idate^matfe associated, 



131900 AA099014 Hs^31Q29 Hc«» sapiens, clone MGW5961, rrflNA. complete cds 
131900 AA099014 Hs^31029 Homo saptens, done MG&159S1, mRNA. com 

131904 AFQ78866 Hs^84296 Homo sapiens CDHA.-RJ22993 fis, doneKAT11914 

131905 AA179298 Hs.3439 stomatin-0ke2 

131913 AW207440 Hs.185973 o^geneiatrirespenratocyte{rKXTO^ 
131916 AA025976 Hs34569 ESTs 

131925 AF151048 Hs.183180 anaphase piorno&ig amplex subwft 11 freest APCTh 
131929 BE541211 Hs.34804 Homo sapiens cDNA RJ11472 fis, done HEKEA10O1711 
131941 BE252983 Hs35086 ubiquifo specific protease 1 
131950 AA355113 Hs35380 xOOIproteh 
131962 AKD00046 Hs>2b7448 hypoftefical prate* FU20039 
131965 W792B3 Hs35962 ESTs 
131971 BE567100 Hs.154938 hypcfliefcal rxotefo M)SQ25 
131977 U90441 Hs3622 pnxoSagenfioltea, ^ 
131985 AA503G2D HsJ6563 m^etk^ protein FU22418 
131991 AF053306 Hs36708 buddbg unhhforted 
132019 H56995 Hs37372 Hon» sapiens DNAbfcta^rjeprj^ 
132031 AF193844 Hs3758 COP9compJexsubunft7a 

ctadvfn-assodated proteki AP47 
kajyopherina^3fm)porfina!pha4) 

ESTs 
ESTs 
ESTs 

kineste4to1 

fibroblast acflvaflon protein, alptai 
rrdtochondrtel ribosomal protein L37 

~ — ESTs 

132203 NM.004782HS.194714 synaptosoroalassodated proteh,29fcD 
132207 BE206939 Hs.42287 E2Ftranscrip8on tactor 6 
132235 AV658411 Hs.42656 WAA1681 protein 
132240 AB018324 Hs.42676 WAAD781 protein 

132252 A1566004 Hs.141269 Homo sapiens cDNA: FU21550 fis, done COLD6258 

132 



132062 BE266155 H&3832 
132084 NlyL002267Hs3866 
132103 BE171921 Hs3991 
132105 AV646076 Hs39959 
132116 AW960474 Hs.40289 
132176 AAB57Q25 Hs3&78 
132180 NKL004460H&418 
132192 AA206153 Hs.4209 
132194 R42432 HiU212 



7>4 
22 
Z1 
1.7 
5.1 
13 
5.0 
13 
13 
22 
13 
23 
13 
23 
23 
53 
23 
3,4 
33 
23 
23 
3.4 
253 
53 
2.4 
73 
1.4 
4.1 
A2 
33 
5.1 
53 
13.7 
2.4 

12 
&7 
23 
53 
113 
1.7 
$2 
Z7 
53 
23 
13 
Z3 
1.4 
33 
63 
Z4 
2.1 
3.2 
53 
13 
3.7 
1.4 
53 
1.7 
33 
14.7 
53 
4.4 
22 
22 
73 
1.5 
13 
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32266 AA301228 Hs.43299 hypolheBcal proteh FU12890 5.7 

32273 AA227710 Hs.43658 DKFZP586L151 protein J-2 

32276 AA653507 Hs^85711 hypothefeaJ proteh FU13089 2.1 

32288 N36110 Hs305971 solute canter family 2 (tacffitated glucose transporteO, member 10 15 

32294 AB023191 Hs.44131 KIAAQ974 protein ] 00 

32298 NM.0159S6Hs.7120 cytokine receptor -lite motecule 9 1-9 

32299 AW405882 H&44205 cor&tatm * 2 
32325 N37065 Hs.44856 hypofoetalprotelnRJ12116 25 
32348 AW067708 Hs.170311 heteiogenecwiwclearrtT5Cfluctec^ 65 
32370 AW5728Q5 Hs.46645 ESTs 

I32374 AF155582 Hs.46744 ccrel UDPgatactosetta^^ 1-5 

I32376 AI279892 Hs.46801 sorang nextn 14 125 

I32384 AA312135 Hs.46967 HSPC034 protem 283 

132393 AL135094 Hs.47334 hypc*teta» protein RJ14495 15 

32450 AA100012 Hs.48827 hypotieikrfprotehFU12085 ]5 

32452 AW973521 Hs247324 rrtaroxM riboson^ proteb 6.1 

32456 AB011084 Hs.48924 KIAA0512 gene product ALEX2 1.7 
32465 AW169847 Hs.49169 KIAA1634 protein 

32470 AI224456 Hs.4934 Usaptens pdyA ste DMA W 

32484 X16660 Hs.119007 RAB4. member RAS oncogene famfly 1.4 

132518 AW885606 Hs5064 ESTs ™ 

I32528 T78736 Hs50758 SMC4 (structural matetenance of criranosornes 4, yeasty 1 13 

132530 AA306105 H&50785 SEC22, vesicte trafficking pr(to(S.03revislae>te1 25 
132532 AA454132 Hs5080 ntehowm^ribceoi^ 

32534 BE388673 Hs5086 hypo»^caIptote&iMGC10433 £2 

32543 BE568452 H&5101 proieh regulator dcytokmesisl 73 

32571 AW674699 Hs5169 suppresses (2 aBete of SK^ U 

32574 AW631437 HsS184 7H1 drosophfa homotog 7.1 

32596 AKD01484 H&5298 CGM5profen J2 

32611 AA345547 H&53263 rrypot)etical proton RJ13287 . £2 

32612 H12751 HaS327 PR01914 proteto 65 
132616 BE262677 Hi283558 rr/potiefcal protein PR01855 145 
132638 AI796870 Hs54277 DMA segrr^c« chromosome X (unique) 9928 expressed sequence 11.4 
I32648 U51127 H&54434 r^pdhe^ protein MGC1715 15 
32668 AB018319 Hs5460 WAA0776 protein 2-6 
32692 AW191962 Hsl249239 cotegen, type VDl. alpha 2 25 
,32715 F11875 Hjl5534 Hai» sapiens cDHARJ12961 As, ctone NTOT20Q5645 15 
132718 NMJD04600Hs554 Sjogren syrKtnxre arisen A2(601A 3.0 
32724 A1142265 HsJ55498 geranylger^(M»sphatesynftase1 24 
32731 AI189075 H&301872 hypofcelca! protein MGC4840 2.4 
132744 AA010233 Hs55921 ghitamyH)ro^WW syrrmetase 146 
32760 AA125985 Hs56145 toymosfc beta, identified tn neurctbstDrna ceb 2.7 
32771 Y1Q275 Hs56407 phosphoseiijectephalase 35 
32773 AA459713 H&295901 WAA0493 protein 25 
32784 A1142133 Ks56845 GDPtfssocbflonlnha)ftor2 15 
32798 AK26701 Hs5716 WAA0310 gene product 3.7 
32807 U07418 Hs57301 niut(E.cci) homotog 1 (cobncaricer.iwripoJy^ 15 
132810 AB007944 Hs5737 WAA0475 gene product 55 
132813 BE313625 Hs57435 scte carrier faniry 11 (r^rotor^^ 8-7 
32815 A1815189 Hs57475 sex (^cnmkileg homotog 1 6.4 
32817 N27852 Hs57553 toustedflce kinase 2 3.6 
32821 A1S1595 Hs.169610 aui^en (tornlng furcfiw 25 
32833 U78525 Ks57783 eufcryofc translation mit^^ 145 
32842 NMJM6154Hs2re771 rtarr©sapkrector»PP159Sunkr^ 15 
32844 F12200 Hs5811 chrcroosoiro 21 reading tran^ 25 
I32B51 U08716 H&287912 tecBn. n*mriose4)todtog, 1 14 
132663 BE268048 H&236494 RAB10, member RAS oncogene family 42 
32869 AW963217 Hs^03961 ESTs, Moderately similar to AF1 16721 89 PR021 ® [Haptens] 2.8 
132873 AWD07683 Hs58598 WAA1266 proteto 2.0 
32875 NMJXM850Hs58617 Rrxwssc«bted,coteckx)Bccfflbtoto^ 15 
132891 BE267143 Hs59271 U2(RNU2) small miclearRNAaux^ 1.4 
I32B97 AW5Q3667 Hs59545 ringfinger protein 15 5.4 
[32902 AI936442 Hs59838 hypomefical protein FU10808 6.1 
[32912 AW732760 Hs.167578 Homo sapiens cDNA RJ1 1095 fe, done PIACE1005374 7.1 
[32913 W78714 Rs.60257 Homo sapiens cDNA RJ13598 fe, ctone PLACE1009921 25 
[32940 T79136 Hs.127243 Homo sapiens mRNA tor WAA1724 protein, parfel cds 6.1 
132941 AI817165 Hs.6120 hypothetical proteto RJ13222 105 
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32942 AA554458 Hs.197751 WAA0666 protein 1- 8 

32952 AIS58580 Hs31426 rtonx>sap^rnesemirymalstemc^ 22 

32962 AA576635 Hs.6153 CGM8 protein 45 

32972 AA034385 Hs288924 Homo sapiens cDNA FU11392 6s, done HEMBA10C0575 27 

32973 AA035446 Hs323277 ESTs w 
32977 AA093322 Hs.301404 RNAbmaTngmofif protein 3 " 
32980 AAD40696 Hs.62016 ESTs U 
32994 AA112748 Hs£79905 doneHQ0310PRO0310p1 3.0 . 
133012 AA847843 Hs32711 Homo sapiens, done IMAGE:3351235. mRNA 103 

33015 AJ0Q2744 H&246315 UDP44*cetyKafc>ha4>galactosa^ 21 

33016 AI439688 Hs.6289 hyprthefcal protein FU208B6 13 
133053 AW65016 Hs.6390 Homo sapiens done FLB3344 PRO0845 mRNA, comptefecds 6.0 
133062 AW500374 Hs.64056 PRO0149 proteifi 53 
33069 BE247441 H&6430 protein wflh pdygtutamine repeat calcium (ca2-^ homeostasis endoplasmic reticulum protem 43 
33091 AKD01628 Hs.64691 WAA0483 protein 33 
33110 AA80B177 Hs35228 ESTs ^ 1 
33134 AF19B620 Hs.65648 rWbWfogrnoff protein 8A 1-3 
33145 H94227 Hs3592 Hai»sapierts,dorolMAGE^1368,rnR^ 22 
133152 Z11695 Hs324473 m&oo^rKid»atedpT^ 1-3 
33174 AA431620 Hs324178 hypcttefcal protein M6C2745 17.1 
133175 AW955632 Hs36666 ESTs» WeaWysirn2artoS1K60prcJine^pro^ MP4 - mouse [Mjnuscutus] 13 
33177 X9779S Hs36718 RAD54 (SxerevtsfaeHftB 43 
33197 AI275243 Hs.180201 hypofteflcal protein HJ20671 3.1 
133208 AI801777 Hs3774 ESTs 44 
33226 AW954569 Hs286287 Hc>nx> sapiens, SiniTar to bromo^^ 4, done IMAGE3542455, mRNA, partial cds 17 
33228 A1492924 Hs3831 golgl ptosprtoprotein 1 63 
33240 AKD01489 Hs242894 ADP-ribosytefion factorfke 1 1-5 
33254 A1567421 H&273330 Horm>sapier^GtortelMA6E^544D^mRN^ 1.4 
33266 AI16Q873 Hs39233 zinc finger protein 53 
33268 AW956781 H&293937 ESTs, Weakly sknflar to FXD2_HUMAN FORKHEAD BOX PROTEIN D2 [H^apiensJ 13 
I33285 M76477 Hs289082 GM2g^Soskteac8valorproyn 47 
133291 BE297855 Hs39855 NRAS*etated gene 53 
133314 AA102670 Hs.70725 o^nima-aminobutyifc 27 
33321 T79526 Hs.179516 integral type I protein 9-3 
I33327 AL390127 Hs.7104 KnippeWke factor 13 4.4 
33347 BE257758 Hs.71475 acM duster protein 33 1-6 
133360 AI016521 H&71816 v^murtoettiymcra viral orax^^ 53 

33366 AA292811 Hs.72050 rwHnstasfefccefe 5, protein expressed In (iwdecsWeHflphosphate Wrase) 27 

33367 AF231919 Hs. 18759 WAA0539 gene produd 17 
33370 AF2455Q5 Hs.72157 DKFZP564I1922 proten 1-B 
33383 BE313555 Hs.7252 KIAA1224 protein 17 

33390 AI950382 Hs.72660 phosphaQoVlserine receptor 1-3 

33391 AW103364 Hs.727 M>^betaA(ac8^8C&fr)^ 16.1 
33394 AA305127 Ha237225 hypoftefical protein HTO23 122 
33437 AL031591 Hs.7370 phosphofir^tottut transfer protein, beta 10.4 

33452 NH.002759Hs274382 protefaMnase,hterfeion-hA^ 12 

33453 AI659306 Hs73826 pn^ tyrosine r^wspte^ 17 
33500 AW964804 Hs.74280 hypoQie8ca)protehHJ22237 11-1 
33529 W45623 Hs.74571 ADP^fcosyb&n factor 1 23 
33540 AL037159 Hs.74619 proteasomerj)rosorne t macropajn)26S 23 
33543 AU077073 Hs.106327 o^n^e^pec^rmblm^prDknl (12710) 23 

33578 AU077050 Hs.75066 transfin 

33579 X75346 Hs.75074 rnftogen-actrvated protein toase-acfl^ protein kSrase 2 21 
33582 BE391579 Hs75087 Fas^tcirvaledsertoeAivBonte 1-3 

33594 AW160781 Hs.172589 rtudearphosptoprotehstototo&c 22 

33595 AA393273 K9J5133 transcript factor 6MD© 1 (rntochondrfal transcrfpfion factor 1-Dke) 13 
133599 NNL002885HS.75151 RAPl.GTPaseatSvaflng protect 5.7 
33621 NIAJD04893HS.75258 H2AWstonefan%. member Y 253 
133627 NNL0Q2047Hs75280 aJycyV9^syrdhetase 153 
133631 WLP00401Hs75334 exostoses (murtyte) 2 33 
I33649 U25849 Hs.75393 add phosphatase 1, soluble 1-6 
133690 AV661185 Hs75574 iritoduto^rirjosomd 4.1 
133720 L27841 Hs75737 r^rxsrtfrfotar material 1 1-5 
133722 AW969976 H&279009 matrix ©a protein 63 
33751 AW402048.HS334787 rtemosa^Strrilartoiik^ 33 
,33757 T52946 Hs.196209 RAE1 (Rr^expc^1,Swpornbe)ht)molr^ 17 
133760 BE271766 Hs.181357 bmtoto receptor 1 (67kD,tf>osc^ 13 
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133765 M62134 Hs.75929 cadherin 11,lype2,OB<adh^(osteoblasQ 13 

133780 AA557660 Hs.76152 decorfn 35 

133784 BE622743 H&301064 arfapfti 1 6-8 

133791 M3433B Hs.76244 ^wnwfine synthase 2* 

5 1337S7 AL133921 Hs.76272 reflnobfastoma^lndl^ proifiln^ 1.4 

133822 D5052S Hs.699 peptidyiprotyl iscmerase B (cycJophHln B) 83 

133842 AW797468 K&285013 putaflve human HLA class II associated protein I 115 

133845 AA147026 Hs.76704 ESTs 22 

133850 W29092 Hs.7678 ceWarT^nofcackM)b<^ protein 1 13 

10 133859 U86782 Hs.178761 26Sproteason»^ssoQatedpad1 homolog 2J> 

133865 AB011155 Hs.170290 (fees, large (DrosophBa) homotog 5 23 

133867 AW340125 Ha.76989 K1AA0097 gens product 6J 

133868 AB0121S3 Hs.183874 cuto4A 25 
133881 U30872 Hs.77204 centrorrra protein F(35GW^ • 3.0 

15 133922 1)30825 HsJ7608 spicing factor, arginhe/$erhe^tti9 1.4 

133924 D86326 Hs325948 vesicle docking protein p115 5.4 

133929 NKL006306Ka2116Q2 SMC1 (stnjchialmaMeranrac^ 43 

133936 L17128 H&77719 gamrna-gWanryicaftJOxybse 3J 

133941 BE244332 Hs,77770 ao^ptDHrelated protein conptex3.mu2sutenft 12.1 

20 133959 X81789 Hs.77897 spBc^fador3a,subuna3.60U) 9J 

133976 AI908165 Hs.169946 GATA-Wndlng protein 3 (T-cefl receptor gene acflvator) 3.1 

133989 AL040328 Hs.78202 SWl/SNF related, matrk associated, aiin o^jerxJeni regulator of chrornatin 13 

133997 AI824113 Hs.78281 regubtorof Gi»rotehs|gnaIIhg12 9J 

134010 AB016092 Hs.197114 RhW bmd^ r*ctfiirs ATwtch 2.4 

25 134015 D31764 Hs278569 sorfrg nadn 17 25 

134070 NKL003590HS.78946 cuJDh3 13 

134110 1)41060 Hs.79136 UV-1 protein, estrogen regulated 42 

134129 NMJH4742HsJ93G5 WAA0255oene product 22 

134134 H86504 Hs.173328 piotehprwspriatase^ regufato^ 5J) 

30 134200 BE559598 Hs.197803 KIAAM60 protein 32 

134206 AF107463 Hs.79968 spBdogfadt)r30,survivaJc4rndorne^ 2£ 

134208 NM.000288HS.79993 peroa^ornal biogenesis factor 7 2.1 

134219 NMJ30040m8Q208 glucose-Sfhosphafe dehydro ge nase 9.1 

134234 BE300078 Hs30449 Hoito sapiens, dor* IMAGESS^ 23 

35 134275 AI878910 Hs3688 dsptafin resistance-associated overeq>ressed protein 13 

134292 AI906291 Hs31234 IrnmunogJobuIIn surjertamfly, men*er3 20 

134301 AW5Q2505 Hs31360 HorrK>sap^cDHA:FlJ21927^cbreHEP04178 > h 25 

134385 U61397 Hs31424 ubkpiffirvto 1 (sentrtn) 23 

134324 AB029Q23 Hs.179946 WAA1100 protein 10.4 

40 134326 AW903838 Hs31800 (tondro^ sulfate rjr^^ 13 

134329 M92036 Hs31848 RAD21 (S. pombe) bomoiog 23 

134337 NM_004922Hs319G4 SEC24(acerevlsfae)re^gene^ l meinberC 23 

134348 AW291946 Ha£2Q65 interieuto 6 signal transc^ 133 

134367 AA339449 Hs32285 r^tt sph oifec^ydnanTidB toi iryni ansfe i as B, prosrimnlxjs^lycgiarnkte synthetase, 83 

45 134376 X06560 Hs32396 7,?.c4tao8<Sem^syrrtiela^^ 13 

134379 AW362124 Hs323193 hypc*efcaJprotehMGC3222 8.1 

134384 AI589941 Hs3254 HorrwsaiJfens.Shtototumw 

134391 AA417383 Ks32582 Integra beWhe 1 (yrimEGWte repeat domahs) 4.1 

134395 AA456539 Hs3262 lysosomal 17 

50 134403 AA334551 Hs3Z767 sperm specific antigen 2 23 

134405 AW067S03 Hs32772 conagerHtypeXl.a^ha 1 13 

134411 BE272095 Hs.167791 reficukcalbin 1, EF-hand cak*um bmdTng dcmaln 32 

134415 AI750762 Hs32911 protein tyrosine phosphatase type IV A, member 2 1.9 

134421 AU077196 Hs32985 a>Bap^ type V, alpha 2 103 

55 134424 Z44190 Ha.83023 peradsomaJ biogenesis factor 11B 2.4 

134446 AA112036 Hs33419 KIAAQ252 proteh 12 

134447 M586Q3 Ks33428 miciE» factored kappa Ifc^ 13 
134470 X54942 Hs.83758 CDC28 protein kinase 2 2.1 
134480 NhifJ05000Hs33916 Empbtoa^setectedfaomAFFXslrQlBproceset 53 

60 134485 X82153 Hs33942 caftepshK(pycrtcdysc^osis) 23 

134498 AW246273 Hs34131 ftreonyHRNAsyn&ietese 2.1 

134513 AA425473 Hs34429 K1AA0971 proteh 33 

134516 AKD01571 Hs273357 hypometfcaJ proteh FU10709 2,4 <| 

134520 BE091005 Hs.74861 activated RKA polymerase B traecrlptacofectoM 6.7 

65 134529 AW411479 Hs.848 FK506Jbtaflrtg rjro^ 23 

134577 BE244323 H&35951 exporfin, TOM (nuclear export receptorfortRNAs) 55 

134562 AA927177 Hs36041 CGG triplet repeat binding proteinl 53 
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134664 AA256106 Hs37507 
134666 BE391929 Hs3752 
134687 U62317 Hs38251 
I346S2 NML0O3474HSJ850 
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134834 AW451370 Hs3991 
134850 AI701162 H&9O207 
I34853 BE268325 Hs30280 
[34659 D26488 Hs30315 
134880 AI879195 Hs30606 
34910 AA532963 Hs3100 
34925 AW885909 Hs3975 



134612 AW068223 Hs.171581 uNquKn C-femninal hydrolase UCH37 
I34624 AP035119 Hs3700 deleted In fiver cancer 1 
I34632 X7B520 Hs.174139 chkxide channel 3 
i 34654 AKD01741 Hs3739 hypoftefical protein FU10879 
ESTs 

transmembrane protein 4 
arytsufiatase A 

mx*. , oti .i W t.t, w *— a distntegrtn and iretaDcproteinase domain 1 2 (meltrin alpha) 
134706 BE161887 Hs38799 anaphaseiffornoSng canpJexsutwnaiO 
134714 Y14768 Hs390 lysosomal 

134719 AA852S85 Hs39232 chrornobca homotog 5 (Drosophfla HP1 alpha) 
134722 AF129536 Hs284226 Fta on* proteto 6 
I34724 AF045239 Hs321576 ring finger protein 22 
134746 XTJ7871 Hs.89476 CD2anflgen (pSD). sheep red blood eel receptor 
134751 AW630803 Hs39497 bmtoBI 
134790 BEOQ2798 Hs28785D Wegral meni)rane protein 1 
1 34806 AD001528 Hs39718 spermtoe synthase 

adaptor-related prated complex 1,o^rara2suburet 
hypofefical protein MGC11138 

WAA0007 protein 
15 kDa selenoprotein 

Hum s api ens cDNA FU1 3100 fls, clone NT2RP3002255 
I343Z5 awb&oub ttsjwio PRO1073 protein 
134955 AW401361 Hs31773 prate* phosphatase 2 (forrri^ 
134971 AI097346 Hs286049 phosphisenneaimtulidikiifefase 
134975 R50333 Hs32186 l^ancolled-coflproteh 
135011 AB037835 Hs32991 WAA1414 protein 

I35022 KM_000408Hs.93201 glycerophosphate dehydrogenase 2 (mtochondiial) 
135032 AW301984 Hs.173685 hypcAeft^ protein FU12619 
135077 AW503733 Hs3414 WAA1488 protein 

i 35083 AB036063 Hs342B2 p53-hdudble ribcfmcteotJde reductase small subunft 2 homolog 

135095 AF0Z7219 Hs3443 2incftiaer protein 202 

135096 AAD81258 Hs.132390 zmcnn^protefn36(KOX18) 
135153 A1093155 Hs3542D JM27proteh 

135181 BE250865 Hs279529 pxIWteproteh 

135199 AA477514 Hs36247 toaiidtoassoctoted factor X 

135207 N26427 Hs3634 ESTs, Highly similar to C10_HUMAN PUTATIVE C10 PROTEIN [Rsapiens] 
1 35214 T78802 Hs36560 hypofte&al proteh FU11656 
1 35243 BE463721 Hs371Q1 putaSve G pioteirHaupted receptor 

135245 AJ028767 Hs2626Q3 ESTs «... 
135257 AW291023 Hs37255 ESTs, Weak* sfrnifer to M6010Xftitedieunopa%p 
I35263 AI088775 Hs35498 gerai^geranytdlphospr^sym^a»1 
135274 AA448460 Hs.112017 GE36gerre 

135294 AA150320 Hs.9800 protein kinase N)rnu-R1 

135295 AK80838 HsJ98006 ESTs 

135307 AI743770 Hs38368 ESTs,WeatyslmBartoKlAA0822proteto 

135321 AI652069 Hs38614 rB»sorrtf bir^ protein 1 (dog 180kD tomotog) 

135354 AA456454 Hs.183418 ceSdMston cycle 2-8)«1 fJ>rrSLr^ protehs) 

135361 AA373452 Hs.167700 Homo sapiens cDNA RJ10174 fis, clone HEMBA1003959 

135389 U05237 Hs39872 fetal Abheirneranflgen w ^ 

135400 X785S2 HsS9915 ardrogCT receptor (o^y*otesto^ 

134975 R50333 Hs321B6 Leman cdted-col protein 

135011 AB037835 Hs32991 WAA1414 protein 

I35022 KMJW0408HSJ93201 g^ajrofcHiriosD^ 

135032 AW301984 Hs.173685 hypo9ie&alprotBiiFU12619 

I35077 AW503733 Hs3414 WAA1488 protein 

135083 AB036063 H&94262 pSWndudbte rtoonucferide reductases 

135095 AFQ27219 Hs3443 zfatc finger protein 202 

135096 AA081258 Hs.132390 zinc finger protein 36 (K0X 18) 
135153 AIQ93155 Hs35420 JM27 protein 

J35181 BE2508G5 H&279B9 pxl&tte protefc 

135199 AA477514 H&96247 transln^ssociatedtectorX 

I35207 N264Z7 Hs3634 ESTs, HlgWy similar to C10 JtiJMAN PUTAT1 

135214 T78802 Hs36560 rrypofteto protein RJ11656 

I35243 BE463721 Hs37101 putafre 6 proteiKX*ipted receptor 

135245 AKJ28767 Hs262603 ESTs 
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135257 AW291023 Hs37255 ESTs, WeaHy similar to A46010 X-Med 
13S263 AI088775 Hs35498 gerany^eiaiTyltfiphosphatesyneBsel 
135274 AA448460 Hs.113r17 GE36gene 

135294 AA150320 Hs3800 proton kinase NJmu-RI 

135295 AKBQ838 Hs38006 ESTs 

135307 A1743770 Hs38368 ESTs, Weakly stoto to WAA0822 protein 

135321 AI652069 Hs38614 ribosome binding protein 1 (dog IBOkDho 

135354 AA456454 Hs.183418 crfdMston cyde 2^ke1 (PITSLREpr 

135361 AA373452 Hs.167700 Homo sapiens CDNAFU10174 fis, ctone m 

135389 U05237 Hs39872 fe& Alzheimer antigen 

135400 X78592 Hs39915 androgen receptor (aTtydrotestos^ 

302256 AA857131 Hs.171595 HtV TAT specffic factor 1 

302276 AW057736 Hs323910 HTO receptor tyrosine Hrase(c^«rt)-b2, 

303135 AW592789 Hs779474 HSPC070 protem 

303686 AKD00714 Hs.109441 MSTPQ33 protein 

310085 R43191 Hs.101248 Homo sapiens clone IMAGE32553, mRNA seq 

315518 AA808229 Hs.167771 ESTs 

317781 NMJW7057HS.42650 ZW10 Weractor 

320836 AB68997 Hs.197289 rab3 GTPasfracfiraflng protein, rarwafa 

321114 AA902S6 Hs.76979 Gofcl apparatus protein 1 

322221 N24236 Hs.179662 ruictecsome assefnWy proteh 1-&e 1 

322474 AF118083 H&29494 PR01912 protein 

322556 BED41451 Hs.177507 hypome&aJ protein 

323541 AF29210O Hs.104613 RP42hornotog 

407827 BE278431 Hs.40323 BUB3 (budding wtfnhfoSed by benzimktezo 

408196 AL034548 Hs.43627 SRY(sexd8tenrMning regkm Y)4)Ox22 

408813 A1560090 Hs.48295 RNA hefcase feniy 

409176 R73727 Hs.101617 ESTs.WeaWysbitotoT32527hypotiefi 

413670 AB000115 HsJ5470 rvpofte8cal protem, expressed mosteo 



414108 AB67592 HsJ5761 
414846 AW304454 Hs.77495 
416980 AA381133 H&80684 
417378 R57256 Hs32037 
418283 S79895 Hs33942 
418467 NMJJCS910HS-85273 
420269 U72937 Hs36264 
420802 U22376 Hs.1334 



UHXotornamHxmtalnmg 1 
WgtHnobffity group (nonhistone chramoso 
TATA box btoomg protein (TBPJ-assodate 
caOtepstn K (pyenodysostosb) 
refmobtastoma^falmg protein 6 
atphafltalas^ifemTentalretaro^tos^ 
avian myebclastosfevWat^ 
421225 AA463798 Hs.10896 MCT-1 protein 
421642 AF172166 Hs.106346 iB&xjteacHrepiessible protem 
421828 AW891985 Hs2797B9 rifetor»de3cetylase3 
421983 AI252640 Hs.110364 pepWylprctylson* 
422052 AA302744 Hs.104518 ESTs 
422055 NKL014320HS.111Q29 putaSve)ieine4)tndmgproteh 
423750 AF165883 Hil298229 prefokfin2 

424001 W67883 Hs.137476 paternal expressed 10 (PEG10; WAA105 

425182 AF041259 Hs.155040 zmc finger protem 217 

425284 AF155568 Hs.155489 NS1*ssociated protem 1 

426372 BE304680 Hs.169531 DEAD/H (Asp-Gt)-Ala-Asp/frfe) box potypep 

428049 AW1837G5 Hs.182238 GW128 protem 

428477 AW500533 Hs.11482 spDcmg factor, aigfnmetert»rfch 11 

DEAD/H (Asp^5fo-Ala-Asp/His) box potypep 
Homo sapiens riepatncf^iiarcaiclnoma-as 
Homo sapiens clone 23736 mRNA sequence 
skbl (S* pombe) hornolog 
446999 AA151520 Hs334822 hypomedcal protete M6C4485 
447111 AKM7574 Hs.17409 cysteine^ protem 1 (intestinal) 
447778 BB620592 Hs.71190 ESTs, WeaMysmto to ^6506 bypomrf 
446873 NM_003677Hs22393 densit)Heguiated protem 
449687 W68520 H&331328 mtemTeoTate&tenx^prc^SYncoin 
43)701 H39960 H&268467 Homosap»erocDHAFU12280^ckmeMA 
450703 AA011202 Hs>184771 mx^factor l/C (CCMT-bmcfing transc 
452461 N78223 Hs.108106 transcftpflon factor 
452511 BE408178 Hs£85165 Horn) sapler6cDNARJ20845fis, done AD 
453157 AP077036 H&31989 DKFZP586G1722 protem 
453658 BB41906 Hs37819 rtonxsapferis,doneM(^492,inR^ 
100833 AF135168 Hs.108802 N^rymiateh&te^ris^ factor 
102481 U50360 gbMuman calcium, ralrrxxWIiMiependent p 
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4375© AB001636 Hs3683 
438449 AKD01333 Hs3216 
441560 F13386 Hs.7888 
445580 AF167572 Hs.12912 
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102827 BE244588 Hs.6456 chaperonin containing TCP1,subunft2(b 
103549 BE270465 Hs.78793 protefo WnaseC.zeta 
104331 AB040450 Hs.279862 cdkfrih&aDrp21 blm&q protein 
110018 AW579842 Hs.104557 hypoftefca) protein FU10697 
5 115008 AKD01827 Hs.87889 heDcasfrraof 
119075 M10906 H&287820 fibionectinl 
119615 AL034423 Hs.75875 ublqiiffifKanjugalhg enzyme E2 variant 
12S006 BE065136 Ks.145696 spfcing factor (CC1 .3) 
127609 X80031 Hs£30 coOagen, type IV. alpha 3 (Goodpasture 

10 129209 R62676 Hs.17820 RhcKBsodatBd, coied-col containing p 
129917 M30773 Hs^78540 protBinphc«pratase3(fainKnyffl),reg 
130162 BE267033 Hs.192853 ubkjufflrKHijup^ 
130365 W56119 Hs.155103 euka>ofc&anste&Dnh8aiion facto 
131135 NM_016569Hs^67182 IBXHso protein 

15 131853 A1681917 Hs3321 ESTs, KpgHy stmSar to IRX1 J4UMAN IROQU 
131881 AW361018 H&3383 upstream regulatory etamert blndfrigprot 
13Z726 N52298 Hs£5608 rrypofce^ protein MGC955 
135193 X35525 Hs£6103 TATA box binding protein (TBP)-assodate 
409487 H19886 gkyi67a05j1 Scares aduttbfainN2b5H 

20 416040 AW819158 H&289044 Homo sapferecDNAfU12048fis. clone f« 
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Table 4A shows the accession numbers for those pkeys lacking unigenelD's for Table 4, For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mKNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 

Ptey: Unique Eos probeset (denffier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 

Pkey CAT number Accessions 



123615 
123619 
20 101445 
124385 
124417 
124482 



25 



103349 
110856 



120280 
113248 
120472 



30686.-15 

371681.1 

1650.-6 

656394J 

1642364J 

1657509J 

31281.-28 

U0SZ_-2 

19346.14. 



16021*_1 
328626.1 
44573^2 



AA609170 

AA6Q2964AA609200 
M21259 

AB67847N27351 
N34059 N46979 
N53935 N53950 
U50360 



40 129019 44573.2 



45 



120695 9683.3 



AA992380 N33063 N21418 H79958 R21911 H79957 

103797 109699 1 AA080912 AA075318 AA083403 AA076594 M078992 AA084926 AA081B81 AA1 13913 AA113892 
AA083821 AA134801AA082953 AA070343 AA062835 AA075419 AA063293 AA071252 M078900 AA0628& AW974305 

AA190577M181657 

T63857 AW97122D M493469 T63699 

AB50087N7Q208RS7O40 N368Q9AC08119 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 
AA908598 AA251875 A&205O1 AB20532 W87891 TB5904 U71456 T82391 BE32B571T75102R34725AA884922BE328517 
AB19788 AA884444 N92578 F13493 AA927794 AK60251 AW874068 AL134O43AW235363AA663345AVW)08282AA488964 
AA283144 AI890387 AI950344 AI741346 A1689062 AA282915 AW102898 AI872193 AI763273 AW173586 AW150329 AB53832 
AI762688 AA988777 AA488892 AB56394 AW103813 A1539642 AA642789 AA856975AW505512 AJ961530AW629970 
BE612881 AW276997AW513601 AW512843 AA044209 AW856538 M180009 AA337499 AWS61 101 AA251669 AA251874 
AI819225 AW205862 AJ683338 Ai858509 AW276905 AJ633006 AA972584 AA908741 AW072629 AW513996 AA293273 
AA969759 N75628 N22388 H84729 H60052 TB2487 AMJ22058 M78M19AA551005W807M AVW13456AI373032AI564M 
R»531H83488W37181W788Q2R66056AK^^ 
AB5(^ M7O208 R9TO40 N36809 AI3081 1 9 

AAS08598 AA251875 AJ82D501 AJ820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AAB84922 BE328517 
AI219788 AA8B4444 N92578 F13493 AA927794 AC60251 AW874068 AL134043AW235363AA663345AW008282AA488964 
AA283144 AI09O387 AB50344 AI741346 AI689062 AA282915 AW102898 AJ872193 AI763273 AW173586 AW150329 AI653832 
AI762688 AA988777 AA488892 AB56394 AW103813 AI539642 AA642789 AA856975 AW505512 AI961530 AW629970 
BE612881 AW276997AW513601 AW12843 AAO44209 AW856538AA180009 AA337499AW961101 AA251669AA251874 
AK1 9225 AW2D5862 AI683338 AB58509 AWZ76^ 
AA969759 N75628 K22388 H84729 H60052T^ A»22u^ 

RJ0531 H83488 W371 81 W788Q2 R66056 AW02839 R67840 AA30Q207 AW95958 1 T63226 F04005 
AA976503 AB17802 AA953664 AA404613 AA428771 BE280542 AW194691 AI927301 AI740458 AI796100 AI935603 AW052210 
AA970201 AI633384 AM25910 AHM7004 AI241295 AA402816 AA291468 
AA398838AA435847 

AM16568 AA442889 AA417233 AA442223 
AA453641AA454061 



AA157811AA836869 



122188 275673.1 
121581 283769.1 
122618 305217.1 
109026 150431.1 
123658 goibanK_AA609384 
123811 genbanlLAA620586 AA62D5B6 
125115 genbankjre7341 T97341 
125147 hK3TJ=OUND - entraz.W38150 
118737 382979.1 AA199688 N73861 
120274 genbanJLAA177051 AA177051 
113196 genbaro\J57317 T57317 
120504 genbanKJW256837 AA256837 
120639 genban)LAA286942 



W38150 
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120809 ge*banKJ\A346495 AA346495 
113702 genbank_T97307 T97307 

129680 23162J U03749 NMJ01275 J03483 J03915 AI214509 AW245744 AL046455 AA318960 AI741505 AA843875 A1829382 AI560122 
AI858999 D55958 A16840Q5 D53170 AA854091 AKJ25609 D531 19 D54729 D55504 D55377 D55313 AW512244 AA846441 
5 AW043898 Ai969102 AA4G5741 AKBi 983 AA788784 AA7D6586 AA854361 AW470949 AA843095 AA772028 AI148432 

AIQ38109 AA782478 AA910064 AI22Q384 AA781296 AA843881 AA854064 AA843125 AAB43419 AA319Q36 AA319054 
AI273831 W32275AI584185 C05724 AA789023 AI686818 D54392 AI022485AA431410 AA854232 W39212 W15214AA894441 
AI803081 AI167381 AW245389 AA319430 AA335156 A1D42646 AA327030 AA725170 T27943 AA889304 AA976699 AJ687001 
A1821 107 A1865540 AA7721 07 C05286 AA319661 AA405992 
10 101045 enlraU05614JQ5614 

117247 genbank N21Q32 

110501 genbank H55748 

103392 ertraLX94563X94563 

105032 genbank AA127818 
15 119513 NOrj = 0UND_entrez W37933 

105445 genbank AA252395 

121514 genbank AM12112 

121558 genbank AA412497 

121911 genbank AA427950 
20 123315 714071J AA496369AA496646 

114911 genbank AA236672 

409487 1134778 1 H19886 AW4O2B06T10231 
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TABLE 5: Figure 5 from BRCA 001 US 
Table 5 shows genes upregulated in tumor tissue compared to normal breast tissue. 

5 

Pkey: Unique Eos piobeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 
UnjgenelD: Un^ene number 
10 UnJgeneTKte: Unfeene gene tffle 

R1: Ratio of tumor to noima! breast fissue 



Pkey ExAccn UnlgeneD UnlgeneTWe R1 

100114 X02308 HsJ2962 fiiymWybte synthetase 23 

100147 D13666 Hs.136348 osteoblast spec^ factor 2 (fascicSn 73 

100154 H60720 K&81832 WAA0101 gene product 9.2 

100335 AW247529 Hs.6793 plateteVacfivating factor acetyfoydrola 2.7 

20 100666 L05424 Hs.169610 <»44 an^ (hanhg furicta 5.7 

100667 L05424 Hs.169610 CD44 antigen (homing function and IrxSan 9 

100668 L05424 Hs.169610 CD44 anfigen (honing funcfion and Indian 7.6 
100678 AW5Q2935 Hs.740 PTK2 protefn tyrosine kinasa 2 5&2 
100988 AKD00405 Hs.76480 ubiquMce4 11.4 

25 101031 J05C70 Hs.151738 matfefnetafloproiefoa»9(ge^ 82 

101045 J05614 gb^umanpro^erathgcelnocfearanS 5 

101332 J04088 Hs.156346 topoisornerase (DNA) II alpha (170kD) 3.4 

101352 AM94299 Hs.16297 CQX17 (yeasf) hwnotog, i^Wuwnecoxid 63 

101580 NM_012151Hs^3363 coagulation factor Vllkjssodated (Intr 5.7 

30 101582 AF064853 Hs31299 c^iantonudeotobtodhg protein ( 5.6 

101767 M81057 Hs.180884 carboxypeptkiase B1 (fissue) 14.4 

101806 AA588894 Hs.112408 SlOOcatehim^s^ protein A7(psaias 85 

101810 NM.000318KS.180612 peroxisomal membrane protein 3 (35kD,Ze 32 

101983 AI904232 Hs.75323 prohib&) 6.4 

35 102107 BE2586Q2 Hs.182366 heat shock protein 75 1.4 

102165 BE313280 Hs.159627 death associated protein 3 4.6 

102198 AW950852 HsJ4598 polymerase (DNA Directed), delta 2, rogu 43 

102217 AA829978 Hs301613 JTVIgene 6.7 

102220 U24389 Hs35436 lysosomal 43 

40 102302 AA30S342 Hs39171 proteh Idnase Cte 2 2.7 

102348 U37519 Hs37539 8loehydBO^hy*oc£riase3fenfy,n^^ 2 

1Q2374 U33635 Ks.90572 IWprotefo tyrosine Idnase 7 62 

102455 U48705 KsJ5562 (fiscoidlncfcfnah receptor fan^^ 63 

' 102568 W81489 H&223Q25 RAB31. member RASaicop^faniBy 53 

45 102618 AL037672 Hs31071 eDdiacslutarnr^protekil 53 

102687 NML007019Hs330Q2 ubkjuiincanierprotenEZC 43 

102689 U96132 Hs.171280 tydracyacyWtoenzyrra^ 6 

102704 AU077058 Hs34089 BRCA1 associated RING domain 1 13 

102705 T97490 Hs30002 SfralMjctbtecytokfansubfa^ 23 
50 102801 BE252241 Ks38041 pyrfdoxal (pyrktodne, vftamln B8) ldnas 6.4 

102827 BE244588 Hs3456 chapen^com^n^TCP1,siJbunft2(b 5.6 

103060 NiyLQ0S40rfe.155324 matrix nietaltaproteriase 11 (MMP11; stro 43 

103080 AU077231 Hs32932 cycQn D1 (PRAD1: parativyrokJ adenomatos 3.1 

103178 AA205475 H&275865 ribosornal protein S18 93 

55 103206 X72755 Hs.77367 nxrwktnelnotaedbygan^tntB^^ 83 

103238 AS3692B5 HsJ5189 o^attKesoclated protefn 5.6 

103547 AI376722 Hs.180062 prdeasorne (prosome, maoopaty 9.7 

103549 BEZ70465 Hs.78793 protem Idnase Czeta 73 

103886 AKD01278 Hs.105737 hypc^ieQcaI|HD^mi0416sfmQarto 63 

60 104325 BE379766 Hs.150675 pclynierase(RNA)ll(tmcfin^po^ 63 

104827 AWO52006 Hs3551 PRP48TK/WD splicing factor 103 

104846 A1250789 Hs32478 ESTs 5.6 

104854 AA041276 Hs.154729 3-phosphotoostocfepeiK^ protein Hn 123 

104867 AA278898 rtew2S979 hypodie^proa^sM^tosinaOG 2 
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104896 AW015318 H&23165 
104909 AW408164 Hs2491B4 
104916 AW958157 Hs.155489 
104919 AA026880 H&252S2 
104974 Y12QS9 HsJ*78675 
I04978 AI199268 K&19322 
105012 AF098158 H&9329 
I05039 AA907305 Hs36475 
IO5079 AA151342 Hs.12677 
105068 H58S89 K&35156 
I05393 AF167570 Hs256563 
IK547 AA262640 H&27445 
I06G64 BE616634 Hs288042 
(05658 AA985190 Hs^46875 
I05746 AW151952 Hs.46679 
I05858 AF151066 H&281428 
I05930 AF016371 Hs3880 
I06094 AA533491 Hs£3317 
I06350 AKD01404 Hs.194698 
106359 AW390282 Hs31130 
106610 AA458682 Hs.79732 
[06624 NM.003595H&26350 
[06713 BE614802 Hs.184352 
106829 AW959893 Hs27099 
[06846 AB037744 Hs34892 
106873 N4S809 Hs.11197 
I06973 BE156256 Hs.11923 
I07029 AF264750 H&288971 
107197 W15477 Hs.64639 
107859 AW732573 Hs.47584 
[07901 L42612 Hs335952 
107922 BE1S3855 H&61460 
107974 AW956103 Hs51712 
108274 AF129535 Hs272Q27 
I08647 BB4G947 Hs.44276 
I08695 ABQ29000 Hs70823 
I08894 AKD01431 Hs5105 
109011 AA156542 Hs72127 
109068 AA164293 Hs72545 
109273 AA375752 H&82719 
I09468 NM_015310Hs£763 
110240 AK68594 Hs.176588 
110330 AI288666 Hs.16621 
110501 H55748 
110742 AW180338 H&28029 
110762 BE044245 H&30011 
110856 AA992380 
110958 NM_005864Hsl24587 
11125 N63823 H&2G9115 
111179 AKD00136 Hs.10760 
11239 K9Q956 Hs.17230 
J1285 AA778711 Hs.4310 
11392 W46342 H&325081 
11937 BE298665 H&14846 
112244 AB029000 Hs.70823 
1129% AA737033 Hs.7155 
13777 K266947 Hs.10590 
.13791 AB69096 Hs.135578 
113811 BE207480 Hs5994 
13834 726483 Hs5059 
13868 W579Q2 H&90744 
,13870 AL079314 Hs.16537 
113923 AW953484 K&3849 
14275 AW515443 Hs306117 
14895 AA236177 Hs76591 
114965 AI733881 Hs.72472 
115061 AI751438 Hs.41271 



ESTs 

r/anscripflon factor 19 (SCI) 
NS1-assodatedprotefci1 
prolactin receptor 
bronvxtomakK&r&ining 4 



17.7 
5 

17 
1.4 
1.4 



Homo sapiens, Slnte to RIKEN cDNA 2010 72 

chromosome 20 open reading frame 1 13 

ESTs 25 

CGU47 protein 95 
HaiwsapteracDNAFU11027fis»donea 22 

interleuHn enhancer Wwflng factor3,9 5.4 

unknown 9-3 

hypcfteBcal protein FU14299 14 

hypothetical protein FU20059 9.4 

rTypoS»etxalprotBinRJ20739 15 

hypothetical protein 25 

pep^pro^bonieraseH(cycloph^ 52 

rrypofiis6calprotekiRJ14681 63 

cycfinB2 5.7 

bansmsrnbian87superfan^rne^ 63 

flw&il _ 73 

tyrosytprofein sulfoiransferase 2 7.7 

hypc^kalprotefriFU12549 45 

hypo8ietkdprr^RJ23293^n^lD 16J2 

WAA1323 protein 22 
Homo sapiens, done IMA(^3343149, mRNA, 163 



hypothetical protein 
myeloid/lymphoid or rrte*flneageteutem 
gBoma pajhogenesiHelated protein 



6.6 
13 
6.1 
8.4 

23 

Igsuperf^receptorLKIR 22 
pynjvate dehydrogenase ldnas8,tso^ 6.7 
F^aity protein 5 7.1 
horneobootCIO 93 
WAA1077 protein 72 
hypofliefcai protein RJ10569 4 
ESTs 1.4 
ESTs 23 
Homo sapfens mRNA; cDNA DKFZp586F1 822 <f 23 
WAA0942 protein $2 
ESTs,WeaWysirnaartoCP4YJ1UMANCYTOC A2 
DKFZP434J11 6 protein 6.2 
gb9c^4a0l4lSoaresteta)0verspleen 6.1 
hypo9ietir^poteinMGC11256 7.6 
hypotfieik^ protein MGC2963 93 
gfcot37g06.s1 SoaresJesfisJWT Homo sap 23 
signal fcansA)dtaipi©teh{SrOccfltah 67 
ESTs,MotefalBlysirn8arto21»JWMANZ 33 
asporih(LRRdass1) 7.1 
hypc4ieticalprotetof : U22Q87 73 
eukaryotic translation lUUdUuk factor 63 
Homosaprais,ctoretMAGE^9^mRNA t 8.4 
Homo sapiens mRNA; cDNA DKFZp564D016 (fr 105 
WAA1077 protein 145 
ESTs,Mo(teratelysiinflarto2115357ATYK 53 
zinc fir^ protein 313 13.4 
chrtobiBSB) (B N ' O celyl" 13 
Hcfltosar^d^RJ22044tts > doneH 3.1 
EGF-contelnl^ffi>uBr^ 113 
pn^easc^(prc5cmrracn^)26Ssubu 2.7 
hypome^pote^sini^top^ 6.1 
hypoe)e8calpn^aJ22M1simferb 13 
WAAD3C8 protein 153 
protein 7.1 
BNP-R1B 23 
Homo sapiens mRNA full lengft insert cDN 113 
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16011 
16417 
16470 
16637 
17132 
17881 



19403 
19789 

120206 
120253 
20297 
20325 
20327 
20349 
20356 
120371 
20383 
120386 



AK0Q2163 
BE545072 



AF231Q23 

AI867451 

AB037753 

AL359053 

AW499664 

A1272141 

AKD01043 



9075 



20396 
20418 
20472 
20484 
20570 
20582 
20596 
20624 
206% 
20713 
20750 
20774 
20807 
20809 
120984 
121081 
121408 
121505 
121508 
121513 
121549 
121558 
21655 
21744 
121748 
121773 
121832 
121839 
121882 
121911 
21999 
122013 
122036 
122356 
22371 
22372 
22460 
22490 



AF161470 

AI949952 

M10905 

BE539706 

T65004 

AL117554 

BE393948 

H26735 

M131376 

AA191384 

AA195651 

AK0OQ2S2 

AW969481 

AF000545 

AA219305 

AL109963 

AW969665 

AW967985 

AA134006 



AI950087 

AA253170 

AA280679 

BE244830 

AA282074 

AW407987 

AA976503 

AW449855 

AH91410 

A1808909 

AA346385 

AA346495 

BE262951 

AA398721 

AA406137 

AA494172 

AA402515 

AA416653 

AM12477 

AM12497 

AA421537 

AA398784 

BE536911 

AB033Q22 

AW340797 

AA425691 

AM26376 

AA427950 

AM3Q211 

AA431085 

W92142 

AA443794 

AA868555 

AA4460O8 

AW41878B 

AA448349 



H&301724 

Hs.122579 

Kfe^8178 

Hs35173 

Hs.46679 

Hs.62767 

^57664 

Hs.12484 

Hs33484 

H&32K3 

Hs.42315 

Hs^G0622 

Hs.49397 

Hs^87820 

Hs£85363 

Hs.163561 

Hs.119908 

Hs3Q915 

Hs31668 

Hs326401 

Hs.104072 

Hs. 104106 

H&27S732 

HSv55169 

H&296433 

Hs.104196 

Hs.123122 

Hs.154848 

Hs325572 

Hs.79306 

H&26613 

Hs36473 

Hs271445 

Hs^B4228 

H&237323 

Hs/173518 

HSJ96557 
Hs36693 
Hs.193985 
Hs30002 

H&99052 

Hs.186749 

H&98019 

Hs.194417 

K&97887 

H3.181510 

Hs38142 

Hs.178072 

H&97514 

H&234545 

Ks.158854 

Hs38434 

Hs.191606 

Hs.98459 



Hs387Q8 

H&2719S3 

Hs38390 

Hs.178222 

Hs336677 

Hs.99148 

Hst238151 



rrypofoefcal proteii FU11301 
hypothetical protein FU 10461 
hypofoefcalprotetoFU23468 
cadherHEGFlAGsevovpassG-typerece 
hypotiefical proteh RJ20739 
WAA1332proteki 

Homo sapfens mRNA Ml lengm insert cDN 

Human done 23826 mRNA sequence 

SRY(sexdeteJinWrigreg!OTY)-box4 

iniBgn\>Med)dnase^ssodateds8nno 

pir>b*Kflng protein 

butyiafe-fetduced transcript 1 

ESTs 



15 
6.2 
106 
63 
53 
9.8 
2.4 
7.4 
2.1 
2.7 
52 
5.7 
7.4 
5.7 

ESTs 1.4 
ESTs 8.4 
micfeo)vjMelftNOP5/NOP58 6.7 
fcaftrein 5 (KIX5; KLK-tf; stratum com 9.2 
Homo sapiens done PP1498 unknown mRNA45.7 
fibroblast grorti factor 12B 383 
ESTs, WeaWy strnfer to Z195 JWMAN ZINC 15.2 
ESTs 6.4 
r^xtfiefcal protein FU2Q285 16.1 
hypoftefcal protein 163 
putative ptrinerojc receptor 28.1 
EST 12.4 
FSH prtmary response (LRPR1, rat) homofo 9.7 
typofteMpn^DKFZp434D0127 32.6 
ESTs, Mo aenftft ^stm terto ALJU7_HUMAN A 21.7 
eukaryoSc tnsislaDon hHfcaflw) factor 123 
Homo sapiens mRNA; cDNA DKFZp586F1323 (f 11.4 
gbn«j05c02jd NCLCGAPJOd12 Homo sapfen 19.4 
EST 10.4 
ESTs, WeaWf sbnflarto ALU1JWMAN ALU S 14.4 
ZNF135^te protein 105 
N-ace^ft^ucosamfae^PSphatBrnutese 73 
M-phase pnospnoproteih homotog 52 
gtxoq30a04^1 NCLCGAPJ3C4 Homo sapiens 408 
Homo sa}tocDNAFU12727^ done NTT 53 
ESTs, Moderately ^mflarto 21 092K3A Be 7 
ESTs 73 
SH^rrtaWng protein SH36LB2; KIAA1848 63 
gbeEST52657 Fetal heart II Homo sapiens 4.4 
ESTs 5.6 
ESTs, Hghly stmBarto 07550 mismatch 5.4 
EST 6 
ESTs 13.1 
ESTs 28 
ESTs 6J2 
EST 7.4 
gbzB5g123l SoaresJesfisJiHT Homo sap 23 
Homo sapiens mRNA; cDNA DKFZp434B1023 (f 73 
ESTs 7.1 
hypothec protein NUF2R 193 
K1AA1196 protein 73 
ESTs 53 
ESTs, Hightysbiiilar to WAA1048 protein 5 
ESTs 5 
gfcz*50©2^1 SoaresjDtaUetiJS_J&2HF8_ 72 
EST 6.4 
ESTs 63 
ESTs, WeaMy stmflartoALUSjftJMAN ALU S 13.1 
ESTs 73 
ESTs 5 
EST 73 
ESTs,WeetysMlartoS43569 R01H10.6 97 
EST 6.1 
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22432 
22510 
22530 
22572 
22607 
22614 
22616 
122618 
22622 
22717 
22829 
22838 
122856 
122868 
122907 
23016 
23034 
123136 
123152 
(23394 
23466 
123486 
23615 
23735 
23753 
24006 
24385 
24440 
24656 
24683 
24735 
24761 
24768 
24788 
24811 
24812 
24822 



24903 
24930 
24942 
25051 
25056 
25101 
25115 
25280 
27274 
28528 
(28670 
28691 
28772 
28781 
28797 
28868 
28891 
(28946 
28975 



AA44B417 Hs.104990 
AA449232 Hs.99195 
AW959741 Hs.40368 
AM52601 H&99287 
AA453518 K&98023 
AM53630 H&99339 
AA453638 Hs.161873 
M453641 

AA453987 Hs.144802 
AA456859 Hs.178358 
AW2G4530 Hs59500 
AA460584 Hs334386 
AI929374 Hs.75367 
AP005216 Hs.115541 
AA470074 Hs.169896 
AW338067 Hs 373731 
AL359571 Hs.44054 
AW451999 Hs.194024 
AW601773 Hs^70259 
AA731404 Hs.105510 
AA599042 Hs.112503 
BE019072 K&3348Q2 
AA609170 

NR/L013241HS.95231 
AA609955 Hs234961 
AI147155 H&270016 
AI267847 

AA532519 Hs.129043 
AW297702 Hs.102915 
AA381661 Hs.119878 
R22952 H&268685 
AA374756 H&93560 
AW368528 Hs.100855 
R43543 Hs.100912 
R46068 H&28B912 
R47948 Ks.186732 
AA418160 H&86043 
R65763 Hs.101477 
AW295713 H&221441 
AI076343 Hs.173939 



R99978 

T79956 

TB1310 

AI472068 

T97341 

AI123705 



29019 
29076 
129088 



29198 
29347 
29362 
29372 
29404 



H&268892 
Hs.100588 
Hs.100592 
H&286236 

Hs.106932 
AW966158 Hs58582 
R39234 H&251699 
AA975486 Hs.103441 
W27939 Hs.103834 
BE3Q2796 Hs.105097 
N71826 Hs.105465 
NMJDQ2975H3.105927 
AA419006 Hs.106730 
F34856 H&292457 
Y13153 Hs.107318 
BE560779 H&2B4233 
AI816224 Hs.107747 
AI950087 

AW296806 H&326234 
AA744610 Ks.194431 
AA463189 Hs288906 
N57532 Hs.109315 
BB614192 Hs279869 
U3Q246 Hs.110736 
NMJD16039HS.110803 
A1267700 Hs317584 



ESTs 5.4 
ESTs 112 
adaptor-related protein complex 1, sigma 10.1 
EST 11 
ESTs 615 
EST 10J 
ESTs 107.3 
ghzx48eQ6.s1 Scares Jestis_>IHT Homo sap 31.1 
ESTs 55 
ESTs 85 
ESTs 815 
ESTs 753 
SrcJDQB-adapter 55 
Janus kinase 2 (a protehtyrosme Unas 53 
ESTs 115 
HcxnosapurocDHAmi1946fts,dcinehE 25 
ninein (GSK3B Weracfing protein) 8.7 
ESTs 5.1 
ESTs 5.2 
ESTs 3.6 
EST 7.4 
HornosapIeflScO}^FU14680ns,dDneriT 2.4 
gb3«2a1Zs1 Soares_tesfis_NWT Homo sap 75 
FH1/FH2 ctoakKXtttaning protem 10 
Hunfingtin fcnteracfing protein E 9)5 
ESTs 8.1 
gfcao49a10jc1 Stanley Frontal NB pool 2 57.1 
Human DNA sequence tram clone 989H11 on 7.6 
ESTs 83 
ESTs. Weakly antoto M3K9_HUMAN MITOG 7.9 
ESTs ' 113 
Homo sapiens mRNA for K1AA1 771 protein, 9 
ESTs 8.1 
Homo sapiens cDNA: RJ22726 fts, done H 5.1 
hypofl^efeaJ protein FU22604 142 
ESTs 75 
Homo sapiens cDNA FU13558 fts, done PL 6.6 
EST 235 
ESTs 32.4 
ESTs, WeaMys^toAUJBJttJMAN till 225 
ESTs, Moderately sirrrfbr to B34087 hypot 6.1 
EST 1353 
ESTs 5.4 
K1AA1856 protein 55 
gteye57e05.s1 Soares fetal Ever spleen 9.6 
ESTs 8 
Homo sapiens cDNAFU12789fis. clone NT 125 
ESTs, WeaWy sJmBar to DN4-GG7K14 |H.s 25 
Hctno salens, Shiitar to RlKB^cDNA 1700 7.1 
hypo^eo^ protein MGC5576 7.7 
tyrrfoTnekiriaselsotubte 53 
small nuclear rfboratdecprotein poiypept -535 
stemc^grcwmfedor.rymphoc^ 133 
chi or noscTO22openiBad^frame3 3 
Homo sapiens, done MGC:16362, mRNA. com 133 
kyruirenme 3-mwx>caygenase 72 
MCE* proton 14 
DKFZP566C243proteb 15 
gtxwq05c02j(1 NCLC6APJQoi2Horoosapten 25 
ESTs,H|gtUyslrrtotoT46422r9pofiieB 5 

17.1 
205 

53 

rnelanorrahassociated antigen recognised b 75 



WW Dc^rtam-ContaWng Gene 
WAA1415 protein 



sctocarn^{arrdly12(sc<iturn/potassl 6.7 

CGW9 protein 2 

ESTs 5 
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129482 
129559 
129587 
12983 
129649 
129680 
129689 
129702 
129720 
130010 
130097 
130135 
130211 
130242 
130359 
130365 
130448 
130455 
130471 
130503 
130511 
130542 
130553 
130556 
130567 
130574 
130617 
130667 
130693 
130744 
130757 
130880 
130944 
131046 
131060 
131099 
131135 
131185 
131225 
131245 
131283 
131569 
131643 
131714 
131722 
131760 
131793 
131885 
131900 
131905 
131941 
131971 
132180 
132203 
132273 
132288 
132294 
132348 
132370 
132384 
132450 
132465 
132532 
132574 
65 132638 
132718 
132726 
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AA188185 Hs^89043 
W01296 Hs.11360 
H14718 Hs.11506 
AKD0D398 Hs.11747 
AD000092 Hs.16488 
U03749 

AW48482 Hs.77873 
A1304966 Hs.12035 
AA156214 Hs.12152 
AA301116 Hs.142838 
ALD46962 Hs.14845 
AA311426 H&21635 
NM_D03358Hs^3703 
X79201 Hs.153221 
NM.013449HS277401 
W56119 Hs.155103 
BE513202 Hs.15589 
D90041 Hs.155956 
AL121438 Hs.183706 
BE208491 Hs395112 
L32137 Hs.1584 
U64675 Hs.179625 
AFD62649 Hs252587 
AJ907O18 Hs.15977 
AA383092 Hs.1608 
AF083208 Hs.16178 
M90516 Hs.1674 
BE246961 Hs.17639 



hypofoeEcal protein FU 14784 
Human clone 23589 mRNA sequence 
hypofiietk^ protein FU20391 
cafceflctrffn 



67 
75 
63 
3.8 
3.3 



Hs.17962 
Hs.18747 
AL036067 Hs.18925 
BE514434 H&20830 
BE382657 Ha214% 
AA321649 Hsl2248 
AA194422 H&22564 
AL133353 H&226581 
KM-016569Hs267182 
BE280074 H&23960 
H62087 H&31659 
AL080O8O H&24766 
X80038 Hs339713 
AL389951 Hs271623 
AW410601 H9L30Q26 
AA642831 Hs31016 
W3757 Hs311 
X76732 Hs3164 
AW966127 Hs32246 
BE502341 Hs3402 
AA099014 rfe231Q29 
AA179298 Hs3439 
BE252983 Hs35G86 
BB67100 Hs.154938 
NMJD04460HS.418 
NM.004782HS.194714 
AA227710 Hs.43658 
N36110 Hs305971 
ABQ23191 Hs.44131 
AW067708 H&170311 
AW572805 Hs.46645 
AA312135 Hs.46867 
AA100012 Hs.48827 
AW169847 Hs.49169 
AA454132 H&5080 
AW631437 Hs.5184 
AI796870 Hs34277 
Nhi.004600Hs£54 
N52298 H&55608 



gbHuman dirornogranm A (CHGA) gene, pio 14.1 

B7hcmoJog3 Z6 

ESTs, Weakly stanBar to 138022 nypotnefi 7.4 

APMCFtproteh 2 

nucleolar phosphoprotem Nopp34 13 

forkteadbox03A 23 

tubulin, gamma 1 6.1 
ESTs, Moderate)/ sWter to CEGTJflJMAN C1.6 

synovial sarcoma, translocated to X chro 5.4 

brcmoo\)niatodc>ceirttoteflrtp^a^ 85 

euksryofic translation initialion factor 11 

PPARbWkTgproteJn 33 

ri^cetytarsfera^ 335 

adducail (alpha) 2.7 

K1AA0618 gene product 16.1 

ca^ageofigomerfernatrix protein (pse 6.1 

RAM binding protein 2-Bce 1 73 

pftulaiy tuinor-transfonnlng 1 14.4 

EmpHcaBy selected from AFFXsingtepr 4.7 

repBcatkm protein A3 (14kD) 73 

apoptosb antagonizing trar p«fr*Bon fee \2 

glutanBrte4njctose-6-phosphate transanun 12.1 

Homo sapiens ubkjuifin protein Bgase(U 13J9 

ESTs 2 

POP7 (processing oJ precursor. S. cerevi 31 

proteffixOOOl 5.7 

kkiesbvJike2 2.1 

signal transducer and activator of trans 5.4 

smaQb)c^^ cytokine sub^mSyB(Cy 7.4 

myedn VI 5.1 

COX15(yeas$lx>iric^,cytocta 7 

TBX34soproteh 33 

cycfiiBI 53 

thyroid hormone receptor-associated prot 73 

thtoredoorin domain-^onlaWng 23 
r^sapi^ done F19374APOE-C2 gene 13 

nudeopofin 50kD 5 

HSPC182protem 23 

putafreDNAbira&tg protein 23 

prxsphoribosyl pyrophosphate anudotransf 3.4 

mmfeoMhdh 2 23 
Hon} sapjertscDNAFU14^ f^done NT 73 

ESTs 137 
Homo sapiens, clone MGCrt5961, mRNA, com 87 

stomafin-Bke 2 113 

ublqurBo specific protease 1 23 

hypoftieticalprotetoA«)S025 33 

fibroblast atirvationptoteh, alpha 147 

synaptosornal-assodated protein, 29kD 73 

DKFZP586L151proteta 10 

sdutecanierfarnBy2(raaSatedglu 9.2 

KIAA0974proteto 2 

heterogeneiwruclearntiomAieopnT^ 125 

ESTs 283 

HSPC034protem 6.1 

hyrjo&ieScalprotetoFU12085 83 

KIAA1634 protein 6.1 

rnttoctoixirial ribosornal protein L16 7.1 

THIdrosqifiitetarnotog 14 
I^sec^^()ncrmjrnoswT)eX(unkpje)®212.4 

SJogn3nsynAaT«ant^A2(60)0,nl)on 3.7 

hypofoeraproteinMGC955 143 

145 
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132731 AI1B9075 Hs3Q1872 
1 32744 AA010233 HS55921 
I32773 AA459713 Hsl295901 
I32798 AKQ26701 Hs5716 
132810 AB007944 Hs5737 
J32B33 U78525 Hs57783 
1 32842 NMJM6154HS579771 
132851 U0Q716 rfc287912 
132891 BE267143 H&59271 
132941 A1817165 Hs.6120 
1 32972 AA03436S rfe288924 
I32980 AA040696 H&62016 
[32994 AA1 12748 H&2799Q5 
1 33016 AM396B8 Hs5289 
133177 X97795 Hs.66718 
133208 AJ801777 Hs5774 
I33254 AI557421 Hs273330 
133268 AI160873 H&69233 
I33268 AW95G781 Ha293937 
1 33285 M76477 H&289082 

133390 A1950382 Hs.72660 

133391 AW103364 Hs.727 

1 33540 AL037159 Hs.74619 
133594 AW160781 Hs.172589 
133621 M/L004893HS.75258 
133720 L27841 Hs.75737 
I33750 BEZ71766 Hs.181357 
[33784 BB22743 H&301064 
[33791 M34338 Hs.76244 
133797 AU 33921 Hs.76272 
133822 D50525 Hs599 
133850 W29092 Hs.7678 
133865 AB011155 Hs.170290 
133881 U30872 Hs.77204 
[33924 086326 Hs325948 
I33959 X81789 Hs.77897 
133989 AL040328 HsJ82Q2 
133997 AJ824113 Hs78281 
134234 BE300078 Hs30449 
134348 AW291946 Hs32065 
134376 X06560 Hs32396 
134379 AW362124 H&323193 
134405 AW0679Q3 Hs32772 
134421 AU077196 Hs32985 
I34460 NMjOQ5000Hs53916 
134516 AKD01571 H&273357 
I34529 AW411479 Hs548 
134751 AW63D803 Hs59497 
134790 BB002798 H&287850 
134806 AD001528 Hs59718 
I34850 AI701162 Hs50207 
I34859 D2S488 Hs5Q315 
134971 A1097346 H&286049 
135181 BE250865 H&279529 
I3S207 N26427 Hs5634 
135245 ATO28767 Hs362603 
135257 AW291023 Hs57255 
135307 A1743770 Hs58368 
135321 AI652069 Hs5B614 
35354 AA456454 Hs.183418 
135400 X78592 K&S9915 
302Z76 AW057736 Hs323910 
317781 NMJD07057H5.42650 
321114 AA9Q2256 H&78979 
322556 BB041451 Hs.177507 
420802 U22376 Hs.1334 
424001 W67883 Hs.137476 



55 

6.4 

t45 

25 

42 

6.1 



hypoMcal protEin MGC4840 
gJutannytprolyMRNA synthetase 
K1AA0493 potato 
K1AA0310 gene product 
WAA0475 gena product 
euteryofic translation taiBafion factor 
Homo sapiens clone PP1596 unknown rnRNA 7.1 

beta, manmse^fxfing, 1 6-1 

U2(R^)smalnwiearRrMauxBaiyfe 27 

hypoff^calprotehaJ13222 2.1 

Homo sapiens (DNAFU11392 fis, clone HE 35 

ESTs U 

GtoneHQD310PRO0310p1 17.1 

hypoMfcal prated RJ20886 4.4 

RAD54(S.cerevsiae>&e 4.4 

Homo sapiens, clone IMAGE3544662, rnRNA. 13 
zinc finger prcrteto 10.1 
ESTs,Weak^sh»BartoFXD2JttmiFORKH 122 
GM2gangBoskteac8vatDrprotBJn 10.4 
phcephaOdyfeerine receptor 5.7 
lnhflbbi,befeA(acfivfriA,8cevinABa 255 
p}Oteason^(p<osome,n^cropalf))26Ssubu 17 
nudearpr«sphoprotelnsimBartDS.cef 18 
H2Ahlstor»fam^,rnernberY 135 
pericentn^ material 1 6.7 
lamlnto receptor 1 {67WD, ribosomal prot 5.4 
arfapftil 12 - 1 
spermine synthase 9.7 
n^oU3stoma-bvKlbqpn3teln2 1.3 
pep8o>tprory1isorneraseB(cy^^ 9.7 
ceflu^rett^3ckU)hdIng protein 1 42 
discs, large (Drosophita) homoSog 5 5 
centrwrw protein F (350M00IO. mftosln 9.1 
veskfetocting protein pi 15 1.8 
spiting farior3a. subun&3,60H> 10.4 
SWUShP related, n^assocated,3ca 2.6 
regulatDrofG-protE^s^jiaflrig12 13 
r^sap^ctonelMAGE3535294,rnRNA. 103 
MeneuHn 6 signal transducer (gp130, 
2\S-dfocao^nylafe synthetase 1 (4046 
nypofhe&calproteh MGC3222 
cotegen, type Xl r alpha 1 
coBagen, type V, alpha 2 
BiipkK^selectBdfnxnAFFXsmgJepr 
hypoftefical protein FU10709 
FK50WAKDngrjrctek»4(59kD) 
bfrihBI 

integial manbrane proteto 1 
spermine synthase 
hypc>&teteIprotehMGC11138 
WAA00Q7 protein 
prwsphosenne a njn u lransf er ase 
px19~6)(B protein 

ESTs,HfcMystiiiIIartoC10J^^ 1.7 
ESTs 

ESTs, WeaWy similar to A46010 Xtoted 
ESTs,WeaWyshiBartoKIAA0822prrteln 
rfceeorre binding protein 1 (dog 180kD ho 
ceB oTwsfon cycle 2-fflte 1 (RTSLRE pr 
androgen receptor (dtiydrotestosterone r 
HER2 receptor tyrosine kinase (c 
ZWIObrteractor 
GoJgl apparatus protein 1 
hypothetical protein 

v*nyb avian myeloblastosis viral oncogen 
patemaly expressed 10 (PEG10; K1AA105 
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425182 AF041259.HS.155040 zinc finger proteh 217 23 

446999 AA151520 Ha334822 hypoftetfcalpfDteinMGC4485 7.5 

450701 H39960 t&288467 Homo sapiens cDNA FU12280 fis, done MA 5.6 

452461 N78223 Hs.108106 transcription factor 4.7 

453157 AF077038 H931989 DKFZP586G1722 protein 12.1 
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TABLE 5 A 

Table 5A shows the accession numbers for those pkeys lacking unigenelD's for Table 5. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
5 designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 

10 Ptey: LWqueEo3 probes^ Wenfifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 



15 



CAT number Accessions 



35 



40 

122618 
125115 
120809 
45 129680 



50 

101045 
110501 
121558 
55 121911 



19346.14 
44573J2 



123615 
124385 
110856 
20 120472 



25 



30 129019 44573_2 



30686 -15 AA609170 
656394J A1267847N27351 

AA992380 N33063 N21418 H79958 R21911 H79957 
AJ950087 N702MR97040N36803AI3^ 

AA908598 AA251875 AI820501 AB2D532 W87891 T85904 U71456TB2391 BE328571 T75102 R34725 AA884922 BE328517 
AG19788 AA884444 N92578 F13493 AA927794 AI56Q251 AW874068 AL134043 AW235363 AA663345 AWD08282 
AA488964 AA283144 AI890387 AI950344 AI741346 AI689062 AA282915 AW102898 AB72193 AI763273 AW173586 
AW150329 AJ653832 AI762688 AA388777 AA488892 AJ356394 AW103813 AE39642 AA642789 AA856975 AW505512 
AI961530 AW629970 BB612881 AW276997 AW513601 AW512843 AA044209 AWB56538 AA180009 AA337499 AW961 101 
AA251669 AA251874 A1819225 AW205862 AI683338 AIB58509 AW276905 AIS33006 AA972584 AA908741 AW072629 
AV«13996AA2932nAA969759N75628N22388H84729H60052T92W 

AW613456 AB73032 A1564269 F00531 H83488 W37181 W78802 R66056 AIC02839 R67840 AA300207 AW959581 T63226 
F04005 

AJ950087 N7Q208 R97040 N36809 AI308119 AW967677 N35320 AB51473 H59397 AW971573 RS7278 W01059 AW967671 
AA903598 AA251875 AJ820501 AI820532 W87891 T85904 U71456 TB2391 BE328571 T75102 R34725 AA884922 BE32B517 
AI219788 AA884444 N92578 F13493 AA927794 AI5GQ2S1 AW874068 AL134O43AW235363AA663345AW008282 
AA488964 AA283144 AI890387 A1950344 AI741346 AI689062 AA282915 AW1Q2898 A1872193 AI763273 AW173586 
AW150329 AI653832 AI762688 AA988777 AA488892 A1356394 AW103313 A1539642 AA642789 AA856975 AW505512 
AB61530 AW629970BB612881 AW276997 AW513601 AW512843 AA044209 AW856538 AA180009 AA3374S9 AW961 101 
AA251669 AA251874 A1819225 AW205862 A1683338 AJ858509 AW276905 AI633006 AA972584 AA908741 AW072629 
AW513996 AA293273 AA969759 N75628 N22388 H84729 H6QQ52 T92487AW22058AA780419AA551005W80TO1 
AW613456 A073O32 AI564269 F00531 H83488 W37181 W788Q2 R66056 AKW2839 R67840 AA30Q207 AW959581 T63226 
FO4O05 

120695 9683_3 AA976503 AI917802 AA953664 AA404613 AA428771 BE280542 AW194691 AI927301 AJ740458 
AJ796100 AB35603 AW052210 AA9702D1 A1633384 AA425910 AJ017004 AI241295 AA4Q2816 AA291468 
AA453641 AA454061 
genbanRJ97341 T97341 
genban*JW346495 AA346495 

2316^1 U03749 NMJJ01275 J03483 J03915 A1214509 AW245744 AL046455 AA318960 AI741505 AA843875 AI823382 A860122 
AI858999 D55956 AI684005 D53170 AA854091 AHE5609 D53119 D54729 D55504 D55377 D55313 AW512244 AA846441 
AWD43898 A1969102 AA405741 AJ091983 AA788784 AA706586 AA854361 AW470949 AA843095 AA772028 AI148432 
AI038109 AA782478 AA910064 AG2D384 AA781296 AA843881 AA854064 AA843125 AA843419 AA319036 AA319054 
AI273831 W32275 AB84185 CG5724 AA789023 AI686818 D54392 AI022485 AA431410 AA854232 W39212 W15214 
AA894441 A1803081 AI167381 AW245389 AA319430 AA335156 AI042546 AA327030 AA725170 T27943 AA889304 
AA976699 AI687001 AJ621 107 AB65540 AA772107 C06286 AA319661 AA405992 

entrc?J05614J056t4 

genbanLH55748 H55748 

genban)UVM12497 AA412497 

genbanXJW27950 AA427950 
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TABLE 6: Figure 6 from BRCA 001 US 
Table 6 shows genes upregulated in tumor tissue compared to normal breast tissue. 

5 

Ptey: Unique Eos probeset Idenffifer number 

ExAccn: Exemplar Accession number, Genbank accession number 
UnfeenelD: Unigene number 
10 UnigeneTffle: Unfeenegeneffite 

R1: Ratio of tumor to normal breast tissue 



15 



Pkey ExAccn Unlgenefi) UnlgeneTtite R1 



100147 D13666 Hs.136348 osfc8obJastspecifk;fac^2(fasctdln 75 

100678 AW502935 Hs.740 FTK2 protein tyrosine kfnase 2 SU 

101806 AA586894 Ha.112408 S100caWui)^dingprc^A7(p 8.9 

102455 U487Q5 Hs.75562 dlsrokfln domain receptor famBy.member 65 

20 103206 X72755 Hs.77367 monokine induced by gamma interferon 8.8 

105743 BE2465Q2 Hs3598 sema domain, imrrajnogtobuSn domain 0g). 23 

105746 AW151952 Hs.46679 rrypoftetkd protein FU20739 13 

106373 AW503807 H&219Q7 histone acetyttransferase 13 
110240 A1668594 Hs.176588 ESTs, Weakly similar to CP4Y_HUMAN CYTOC 4.2 

25 119260 AK001724 Hs.102950 coat protein gamma-cop Z2 

120206 H26735 Hs31668 Homo sapiens done PP1498 unknown mRNA 45.7 

120253 AA131376 H&326401 fibroblast growfttactor12B 383 

120297 AA191384 Hs.104072 ESTs, Weakly sMar to Z195J1MW ZINC 152 

120624 AW407987 Hs.173518 *Hihasephi)spl»ct>fO^ 52 
30 120695 AA976503 gbxjq30a04^1 NCLCGAP_GC4 Homo sapiens 463 

120807 AA346385 Hs3Q0Q2 SH3^xmtair^ prcleh SH30B2; WAA1848 63 

121508 AA402515 Hs.97887 ESTs 28 

122607 AA453518 H&98Q23 ESTs 615 

122616 AA453638 Hs.161873 ESTs 1073 

35 122618 AA453641 gkzx48eQ6.s1 SoaresJesfe.NHT Homo sap 31.1 

122829 AW204530 Hs39500 ESTs 813 

122838 AA4605B4 Hs334386 ESTs 753 

123753 AA609955 H&234961 Hunfingtin interacfing protein E 303 

124385 AI267847 gbaq49a10jc1 Stanley Frontal NB pool 2 57.1 

40 124860 R65763 Hs.101477 EST 219 

124930 A1076343 Hs.173939 ESTs. WeaWy sMar to ALUBJiUMAN IB! 223 

125051 T79956 Hs.100588 EST 1353 

128781 W71826 Hs.105465 smaD nuclear rfcormdeoproiein pdypept 533 

129096 AA463189 Hs288906 WW DwnabCcnlaWng Gene 203 

45 129347 BE614192 Hs779869 rnelanoina^scdated antigen recognised b 73 

13689 AW748482 Hs.77873 B7homolog3 23 

130503 BE208491 te295112 KIAA0618 gene product 16.1 

130511 L32137 Hs.1584 C2fffi^eciiQ^mrk:rr^prot^(ps6 6.1 

131046 AA321649 H&2248 srriafl inducible <*tokinesu^^ 7.4 

50 131643 AW410601 Hs30026 HSPC182 proteh 2.9 

131925 AF151048 Hs.183180 artapriase prmrotirig a>r^^ 2.7 

v 132180 M1JD04460H&418 fibroblast adwaflon protein, alpha 14.7 

132370 AW572805 Hs.46645 ESTs 283 

132994 AA1 12748 K&279906 ckxie HQ0310 PRO0310p1 17.1 

55 133016 AI439688 Hs3289 hypothec prtfehFU203« 4.4 

133266 AI160873 Hs39233 zbx: firmer protein 16.1 

133391 AW103364 Hs.727 irth^ beteA (acfivtnA. acfivinABa 253 

134169 A1690916 Hs.178137 transducer of ERBB2. 1 1.2 

134219 NtiQ00402Hs30206 p^icose^ftasphatedeh^^ 13 

60 134405 AW0679Q3 Hs32772 coOagen, type XI, alpha 1 723 

134529 AW411479 Hs348 FK5064)rndIj>g protein 4 (59kD) 23 

134975 R50333 Hs32186 Leman coied-coB protein 2.6 

1351B1 BE250865 HaZ79529 px1943te protein 143 

322556 BE041451 Hs.177507 hypothefca! protein 23 
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TABLE 6A 

Table 6A shows the accession numbers for those pkeys lacking unigenelD's for Table 6. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession** column. 

Ptey. Unique Eos probesetktenffiier number 

CAT number Gene duster number 
Accession: Genbank accession numbeis 

Pkey CAT number Accessions 

AG67847NZ7351 

AA976503 A1917802 AA953664 AA404613 AA428771 BE280542 AW194691 AI927301 AI740458 AI79B100AB35603 
AW052210 AA97Q201 AI633384 AA425910 M017004 AI241295 AA4Q2B16 AA291468 
AA453641AA454061 



124385 556394J 
120G95 9683.3 

122518 305217.1 
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TABLE 7: Figure 7 from BRCA 001-1 US 

Table 7 shows genes upregulated in tumor tissue compared to normal breast tissue. Open 
reading frames in the sequences have been characterized as having a signal sequence (SS), a 
transmembrane domain (TM) or other. 



10 Ptey: Unique Eos pfotesetWenfifer number 

ExAccn: Exemplar Accession number, Genbank accession number 

UrageneO): Un&ene number 

Unkjene Tffie: Unigene gene ffite 

R1: Ratio of tumor to norma! breast tissue 

IS ORFsbuctlnfo: Stmctural crtafadertata gene 



35 



Pkey ExAccn 


UnlgeneB) 


UnJgeneTfSe 


R1 


ORF struct info 


100113 NMJ01269 


H&84746 


chromosome condensaOun 1 


23 


TM 


100/114 XQ2308 


H&82S62 


mymkfylate synthetase 


ZB 


other 


100131 D12485 


Hs.11951 


e(ionudeofidepyiDphos|4ffllase/phosphc^ 


1.9 


other 


100146 BE185499 


H&2471 


K1AA0020 gene product 


1J9 


TM 


100147 D13666 


Hs.136348 


osteoblast specific factor 2 {fasckin 


7.6 


other 


100154 H60720 


H&81892 


KIAA0101 gene product 


9.2 


other 


100163 W44671 


Hs.124 


gene predicted from cDNA wnn a complete 


1.6 


outer 


100220 AW015534 


HS217493 


annexinA2 


Z 


other 


10Q2& D38521 


Hs.112396 


.-,|„t.i 

K1AAD077 protest 


4 C 

lJ> 


other 


lUUl/1 DClDUwl 


Hs2S62SQ 


QinAmlraim_h&w4mn nmipin A11 (cidnv? 
o iw coKauiirMiiiuxitg jxuicii mi ^t*uyit. 


13.5 


other 


100275 BE242802 


Hs.154797 


KlAA0090pio1ein 


5.1 


other 


100323 D50920 


Hs*23106 


WAA0130 gene product 


1.9 


TM 


100335 AW247529 


HSv6793 


pbte&acfivafing factor acetylhydrota 


2.7 


other 


100364 NIUL004341 


Hs.154868 


caiteiroyl-phosptotd synthetase 2, aspart 


2 


other 


100372 NM.014791 


Hs.184339 


WAA0175 gene product 


2J5 


other 


100383 D84145 


H&39913 


novo) RGD-containfng protein 


32 


omer 


100400 AW954324 


Hs.75790 


phosphatidySnosSDi glycan, class C 


1.5 


other 


100418 D86978 


HaM790 


K1AAQ225 protein 


2 


omer 


100482 M65Q2B 


HSJ1361 


heterogeneous nuclear rttmicteoprotefri 


23 


other 


100518 NM.004415 


HsJ4316 . 


(tesnioplaft(DPlDPlQ 


1.9 


other 


100666 L05424 


Hs.169610 


CD44 antigen (homing functon and Indian 


5.7 


other 


100667 L05424 


Hs.169610 


CD44aitt^(hon^faK^aridlixfian 


9 


? 


1006S8 L05424 


Hs.169610 


CD44an^(hcfr^gftjnctoandtna^ 


7.7 


other 


100678 AW502935 


Hs.740 


PTK2protehtyrosHie kinase 2 


53.2 


other 


100783 AF078847 


Hs.191356 


general liansuipGon (actor l!H» polype 


6 


other 


100892 BE245294 


Hs.180789 


S164 protein 


1.7 


? 


100945 AF0Q2225 


Hs.180686 


uWquMn protein Dgase E3A (human papi 


1.5 


other 


100969 AA157634 


Hs.79172 


solute canter famBy 25 (mtochcfidrial 


6.3 


omer 


100988 AKD00405 


H&76480 


ubIquMke4 


11.4 


7 


100S99 H38765 


Hs£07Q6 


diaphorase (NADH/MADPH) ((ytochrome c-5 


1.6 


other 


101031 J05070 


Hs.151738 


matnxnie4aIbpiDteiiase9(geMi^ 


8.4 


omer 


101045 J05614 




gJbfluntanpniEeralingceflnuc^ 


5 


? 


101077 N99692 


HSJ5227 


Ertpek^ selected from AFF^sbTgtec^ 


2.6 


o&ier 


101093 106419 


Hs.75093 


cfoooftagen-tyslrie, 2-oxogIutarate5-dio 


1.4 


? 


101186 AA020956 


Hs.179881 


oof&btndlng factor, beta subunS 


2 


TM 


101216 AA284166 


H&B4113 


(*cfrHiependertldna^ 


1.8 


omer 


101228 AA333387 


H&B2916 


chaperonhconta^TCPI, $ubunR6A( 


1.7 


TM 


101247 AA132666 


Hs.78802 


glycogen synthase kinase 3 beta 


1.9 


other 


101249 L18964 


Hs.1904 


protein kinase C, iota 


US 


omer 


101332 J04088 


Hs.156346 


topcfcomerase (DMA) D alpha (170kD) 


5-3 


other 


101352 AW94299 


H&1G297 


CQX17 (yeast) hwnoiD&cytc^ 


42 


other 


101396 BE267931 


Hs.78996 


profifejaflng cell nuctear antigen 


15 


TM 


101445 M21259 




gbJtuman Aki repeats h me region 5 to 


1.6 


TM 


101470 NM.000546 


Hs.1846 


. turner protEtop63(Ufrajmertf 


25 


other 


101478 NNL0Q2890 


Hs.758 


RAS p21 prcteh acfiratar (GTPase actfva 


5.5 


other 
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101463 M24486 
101540 J04977 
101573 AW248421 
101580 NHJM2151 
101592 AF064853 
101621 BE391804 
101702 AW504089 
101734 M74099 
101759 MB0244 
101767 M81057 
101782 AA306495 

101805 AW409747 

101806 AA586894 
101810 NM.000318 
101879 AA176374 
101911 AA441787 
101920 AF182G45 
101973 U41514 
102009 BE245149 
102036 BE250127 
102083 T35901 
102107 BE256602 
102123 NNL001809 
102165 BE313280 
102198 AW950852 
102217 AA829978 
102220 U24389 
102234 AW1 63390 
102260 ALQ39104 
102302 AA306342 
102330 BE298063 
102339 BE378432 

102348 U37519 

102349 AU077055 
102369 U39840 
102374 U33635 
102391 AA296874 
102455 U48705 
102465 NM_001359 

102488 U50939 

102489 AL030116 
102494 AI188137 
102501 AF217197 
102522 BE25D944 
102532 AFO40253 
102564 U59423 
102568 W81489 
102580 U60806 
1Q2SJ1 AU077228 
102582 U61232 

102617 A W161453 

102618 AL037672 
102627 AL021918 
102663 NMJ302270 
102676 BE262989 
102687 NM.007019 
102689 U96132 
102698 BE54Q274 

102704 AU077058 

102705 T97490 
102750 A6014460 
102801 BE252241 
102812 U90549 
102827 BE244588 
102844 AV653790 
102868 X02419 
102925 BE440142 



Hs.76768 

H&84981 

Hs^50758 

H&83363 

Hs31299 

Hs.62661 

Hs.179574 

Hs.147049 

Ha.184601 

Ks.180884 

Hs.1869 

Hs.75612 

Kb.112409 

Hs.180612 

H&243886 

Hs.119689 

Hs3Q24 

Hs30120 

H&82643 

Hs32906 

Hs.75117 

Hs.182366 

Hs.1594 

Hs.159627 

Hs.74598 

H&301613 

H&65436 

Hs778554 

Hs.159557 

Hs59171 

Hs.77254 

H&95577 

Hs37539 

H&2891G7 

Hs£99867 

Hsl9Q572 

Hs.77494 

Hs75562 

Hs31548 

H&61B28 

Hs.74420 

Hs.75193 

Hs74562 

Hs.183656 

Hs.70186 

RsJ9C67 

Hs^23025 

Hs.152981 

Hs.77256 

Hs32675 

Hs.198767 

Hs31071 

Ks.158174 

Hs.168075 

Hs.12045 

Hs.93002 

Hs.171280 

H&239 

Hs54089 

Hsl500Q2 

Hs56196 

Hs38041 

HsJB6774 

Hs5456 

Hs324275 

Hs.77274 

Hs^943 



procoBagefFproftne, 2-oxog!utarate 4-di 
X-ray repair cornpternenflng defective rep 
pc^easo7T»(^ok)me f macfopaJn)26Ssu^ 



guanine nucfeoBtte bhcflng protein (G pr 

guanytate b&xflng protein 1, hterierork 

protefri phosphatase 2 (formerly 2A), Teg 

cut (DrosophDaHIte 1 (CCAAT dispbcem 

sciufce carrier farnBy 7 (cattonic amino 

cartxnypepfidase B1 (tissue) 

phosphogluoomutase 1 

stress^rKhjcedjitosph^ 

S100 ralchim-binifing protein A7{psorfas 

peroxisomal membrane protein 3 (35KD, Ze 

rariearaiJtosjiflgenfcspemipr 

glycoprotein hormones, afcha polypeptide 

D<cytokIna,aow-regutatorcrfHU\il 

UDP^I-acetykilpha^^ 

protefo tyrosine ktase 9 

CDC20 (eel cSviston cycle 20, S. cerevi 

tntateuxtn enhancer binding factor 2, 4 

heat shock protein 75 

centromere protein A (17kD) 

death associated protein 3 

polymerase (DNA dreded), deita 2, regu 

JTVIgene 



hetenxhrornatin-Qte protein 1 

karyophertn alpha 2 (RAG cohort 1, impor 

protein kinase C-fike 2 

chrornobax homotog 1 (Drosophfla HP1 beta 

cycSn-dependent kinase 4 

aldehyde de th ydru y a * ae 3 family, member 

bacutovffallAPrerjeat^xmtair*ig2 

hepatacyte nuclear factor 3, alpha 

PTK7 proteh tyrosine kinase 7 

deaxyguanosfoa kinase 

discokfin domain receptor tam2y, member 
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Homo sapiens mRNA fell length teertcDN 
H2A hlstone tamfly. member L 
CGWM protein 

ESTs. ModefateV simBar to T00358 hypot 
ESTs 

DKFZP586N0819 protein 
bacuJovtra) IAP repeatcontairang 6 
hypofte&cal proteh HJ10808 
Homo sapiens cDNA RJ13634 fis. done PL 
deleted in cancer 1; RNA heBcase HDB/D1 
Homo sapiens cDNA RJ11663 fis, done HE 
K1AA1025 protein 



nuctear factor VC (CCMWitodng tense 
hterferon-lnduced protein 75, 52kD 
ESTs 

KIAA12Z* protein 

Human done 23826 mRNA sequence 
chrDmosome 21 open reading frame 57 
putative heBcase RUVBL 
SRY (sex deterrrdrAig region Y)-bax 4 
phospfofcosfflde^Mnase, regulatory su 



B^CU7)ymphoma7A 
hypolheOcal proteh MGC10765 
hypomeocat proteh FU14566 
ESTs. WeaWy ssnter to 138022 hypofoefi 
ESTs 

YYI-assodated factor 2 
p10-bWhg protein 

gteyx46ftfts1 Soares mebnocyto 2NbHM Ho 
Hcmo sapiens cDNARJ1 1971 fis, done HE 
Homo sapiens cDNA FLU 0590 fls, done NT 
ESTs 

ATPase, tt+ transporting, lysosomal fvacu 
ESTs 

solute canter family 16 (morwarboxyuc 



ESTs 

CGI-12 protein 
zinc finger protein 281 
chrornosome 11 open feeding frame 24 
hypofheficat protein MGC5370 
hypometfcat proteh FU13912 
hypothetical proteh HJ20048 
KIAA1785 protein 
EST 
ESTs 

cytochrome c oxidase subunft Vic 
bronxxfomam and PW) finger contatnfog,3 
rapa-2 (rapa gene) 
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18509 N22617 Hs.43228 Homo sapiens CDNAFU1 1835 fe. done HE 

18528 AI949952 Hs.49397 ESTs 

18656 AW5802D Hs293287 ESTs 

18870 AA332845 Hs.152618 ESTs,MooWe*yskBarto2N91JiUMAN2 

ABQ33113 Hs501S7 WAA1287 protein 

8737 AA199688 gtazq75g09j1 StratagenehNT neuron (937 

18925 N92293 Hs206832 ESTs, Moderate^ skitfar to ALLWJiUMAN A 

18984 AI668709 Hs240722 ESTs, Moderately sffnBar to ALU8_HUMAN A 

18986 AF148713 Hs.125830 bbcklarcanoeiovenatpressedprota^ 

19206 W24781 Hs293798 WAA1710 protein 

19235 AW453C69 Hs3657 acfivMependsfitrtsuopro^^ 

19265 BE539706 Hs285363 ESTs 

19279 N57568 Hs.48028 EST 

9298 NM.001241 Hs.155478 cycflnT2 

9338 AI417240 Hs320836 ESTs,WeaWysimfartoA47582&^ePgr 

19403 AL117554 Hs.119908 nucleolar prote«NOP5/NOP5B 

19478 A1624342 Hs.170042 ESTs 

19486 AI796730 Hs55513 ESTs 

9513 W37933 EmpteDy selected from AFFXsingtepr 

9601 AKD00155 Hs31684 Homo sapiens n*^cDNAI><FZp6S7l103(fr 

9602 AW675298 Hs233694 hypotu^ protein RJ1 1350 
19676 AA243837 Hs57787 ESTs 

19682 W61019 Hs57811 ESTs 

19774 AB032977 Hs.6298 WAA1151 protein 

9780 NM_016625 Hs.191381 hypotheftal proteh 

9789 BE393948 Hs50915 kaffloefri5(KLK5;KLK4J2;stra&Jmcom 

19805 AJ223810 Hs.43213 ESTs, Weatfy stmflar to 1EFS_HUMAN TRANS 

9818 AA130970 Hs58382 hypothefical protein FU11 101 

9863 AA081218 Hs58608 Hero sapiens cDNAmi42D5fis. done NT 

9905 AW449064 Hs.1 19571 cofegen.type III, alpha 1(EhtoOart 

AA703129 H&58963 ESTs 

20132 W57554 H&125019 rymphoWiuriearproto 

120206 H26735 Ha91668 Hon»sap^dw»PP14^unknowmRNA 

20248 A1924294 Hs.173259 uratoactBrizBdbonamamwpnMeinB^ 

[20269 AW131940 Hs.104030 ESTs 

20274 AA177051 gbcrefl2aQ2ii1 NCLCGAP J>i3 Homo sapiens 

[20280 AA190577 gtezp52g02s1 Statagene HeLa cefl s3 93 

20296 AW995911 Hs299883 hypoftetol protein FU23399 

[20297 AA191384 Hs.104072 ESTs, Weakly similar to Z195_HUMAN ZINC 

20324 AA195517 Hs.191643 ESTs 

120325 AA195651 Hs.104106 ESTs 

20327 AKD00292 Hs27S732 hypofteiical protein RJ20285 

20336 N85785 Hs.181165 eukan^transtaSonekinp^fector 

120342 AW450669 Hs.45068 hypcAeflcaJprriehDKFZp434l143 

120345 AA210722 Hs.104158 ESTs 

20349 AW969481 Hs55189 hypofoetal proteh 

I2Q352 R06859 Hs.193172 ESTs. Weak* starter to O8022hypoftefl 

20356 AP0OO545 Hs296433 putaflwpunnergk; receptor 

20371 AA219305 Hs.104196 EST 

20382 AA228Q26 Hs36774 ESTs 

120383 AL109963 Hs.123122 FSHpn^n3sporee(U^1, raQhomoto 

I20386 AW969665 Hs.154848 rrypoftetfcal protefo DKFZp434D0127 

AA232874 Hs.104245 ESTs 

AW967985 Hs425572 ESTs, Moderately sbnfer to ALLtfJtUMAN A 

20396 AA134006 Hs79306 eukaiyofc transit »to 

120404 ABQZ3230 Hs£6427 WAA1013 protein 

20418 AW966893 Hs266i3 Homo sapiens mRNA; cDNA DKFZp586F1323 (I 

20423 AA236453 Hs.18978 Honw sapiens cDNA: FU22822fis,dor»K 

[20472 AB500B7 ata«j05c02jc1 NCLCGAPJOd12Horr»sapfen 

20473 AA251973 Hs269988 ESTs 

120484 AA253170 Hs.98473 EST 

120504 AAK6837 gfczrB4<rt0.s1 Soare3_NhHMPu_S1 Homosapi 

20509 BE047718 H&96545 ESTs 

20520 AA258601 Hs.161731 EST 

120535 BE35Q244 H&36547 ESTs 

20551 AA279160 Hs.111407 Homo sapiens, clone IMAGE3613029, mRNA, 

20570 AA280679 Hs.271445 ESTs, WeaWy simfer to ALU1J4UMAN ALU S 
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120582 BE244830 Ha284228 ZNF13Wflte proteto 

120990 AW372799 Hs.125790 teucfofrffch repeatantafoing 2 

120596 AA282074 Hs^37323 h^Bo^hioosamfri&iiho^ihateinutase 

120619 AW965339 Hs.111471 ESTs 

120624 AW407987 Hs.173518 Miihasaphosphapratelnhomotog 

120639 AA286942 gbczs56fl)5.s1 NCLCGAP_GCB1 Homo sapiens 

120648 AA287095 Hs.140309 Homo sapiens, done 1MAGB3677194, mRNA, 

120653 AW063659 Hs.191649 ESTs 

120668 AW969638 Hs.112318 6.2 Id protebi 

120669 BE536739 Hs.109909 ESTs 

120695 AA976503 pjbcoq30a04.s1 NC1_CGAP_GC4 Homo sapiens 

120696 AB21539 H&97249 ESTs 

120713 AVW49855 HSw96557 Homo sapiens cONA RJ12727 fis, done NT 

120718 AA29Z747 Hs.97296 ESTs 

120750 AI191410 H&96693 ESTs, ModeiateVslmaarto210926QABc 

120774 AI608909 Hs.193985 ESTs 

120807 AA346385 Hs30002 SH^xmta^protehSH3aB2;KtAA1848 

120809 AA346495 gb£ST52657 Fetal heart II Homo sapiens 

120938 AA386260 Hs.104632 EST 

120977 AA398155 Hs,97600 ESTs 

120984 BE262951 Hs.99052 ESTs 

120985 AI219896 Hs.97592 ESTs 
121011 AA338360 Hs.97608 EST 
121026 AI439713 Hs.165295 ESTs 

121081 AA398721 Hs.186749 ESTs, Highly simBar to 137550 mismatd) 

121133 AA363307 Hs.97032 ESTs 

121176 AL121523 Hs£7774 ESTs 

121223 AKM2110 HsJ7169 ESTs, Weakly sknflar to dJ667H122.1 (H. 

121320 AA4O3008 Hs301927 C6.1A 

121340 AW956981 Hs£7910 Homo sapiens cDNARJ13383fis, done PL 

121408 AA406137 Hs58019 EST 

121439 AA410190 HsS8076 ESTs, WeaMy.sirnfer to A47582 B-ceD gr 

121450 AA406430 Hs.105362 Homo sapiens, cbne MGC:18257, mRNA, com 

121452 AW971063 HsJ232862 ESTs 

121455 H58306 Hs.15165 retmofcadd!ri4iced14 

121457 W07404 Hs.144502 hypdh6ficatpnMiFU22055 

121496 AA442224 HsS7900 ESTs 

121505 AA494172 Hs.194417 ESTs 

121508 AA402515 H&97887 ESTs 

121513 AA416653 Hs.181510 ESTs 

121514 AA412112 gkzt69b02s1 SoaresJesfcJWT Homo sap 
121549 AM12477 HsS8142 EST 

121558 AA412497 gtezt95g12s1 SoaresJesfisJWT Homo sap 

121577 AA411970 H&98096 EST 

121581 AA416568 gbzuQ5c10.s1 Scares jDStfcJJHT Homo sap 

121589 AD0O1528 H&89718 spenrta synthase 

121594 AA626010 Hs.98247 ESTs 

121622 AA416931 Hs.126065 ESTs 

121655 AA421537 Hs.178072 Homo sapiens mRNA: cONA DKFZp434B1023 (f 

121682 AA418160 Hs36043 Homo sapiens cDNA FU13558 fis, done PL 

1216S0 AV660305 Hs.1 10286 ESTs 

121706 U55184 Hs.154145 hypomefcal protein RJ1 1585 

121714 AM19225 Hs^8269 Homo sapiens cDNAFU11953 fis, done HE 

121729 AI949597 Hs£8325 ESTs 

121731 AM21041 Hs.180744 ESTs 

121744 AA398784 Hs.97514 ESTs 

121748 BE536911 H&234545 typofoefcal protein NUF2R 

121773 AB033022 Hs.156654 WAA1196 protein 

121775 AA421773 Hs.161006 ESTs 

121776 AA292579 Hs.125133 hypomofcal protein FU22501 
121786 AI810774 Hs38376 ESTs 

121832 AW340797 Hs38434 ESTs 

121836 AA328348 Hs£1B289 ESTs 

121839 AM25691 Hs.191606 ESTs,H^sintotoKIM1048piote!n 

121842 AFQ27406 Hs.104865 serineWweonme kinase 23 

121847 AA446628 Hs-2799 cartage Bnkmg protein 1 

121871 AW972668 Hs.293044 ESTs 
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121882 
121911 
121915 
121935 
121983 
121985 
121995 
121999 
122009 
122013 
122036 
122050 
122060 
122114 
122188 
122204 
122246 
12225/ 
122302 
122341 
122356 
122369 
122371 
1 22372 
122378 
122405 
122412 
122415 
122418 
122440 
122446 
122448 
122458 
122460 
122464 
122490 
122492 
122502 
122510 
122530 
122547 
122555 
122570 
122572 
122586 
122587 
122598 
122599 
122602 
122607 
122614 
122616 
122617 
122618 
122622 
122717 
122762 
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122834 
122S36 
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AM26376 

AA427950 

AA428179 

AM28647 

AA298760 

AB62570 

AA210863 

AA43Q211 

AW292763 

AA431085 

W92142 

AJ453076 

AA431738 

AW161023 

AA398838 

AM35936 

AA329550 

AA436819 

AA441801 

AW601959 

AA443794 

AA443985 

AA868555 

AA446008 

AB032948 

AA446572 

AA446869 

AA446918 

AM46966 

AW505139 

AA447603 

AA447626 

AI266159 

AW418788 

AA448158 

AA448349 

AA448417 

AA204969 

AA449232 

AW959741 

AA779725 

AA194055 

AA452578 

AA452601 

AKD01910 

AB040&3 

AKB8173 

AL355841 

AA411925 

AA453518 

AA453630 

AA453638 

AI681535 

AA453641 

AA453987 

AA456859 

AB76875 

AW204530 

AA461492 

AA460581 

AA461509 

AA460584 

AA60Q235 

AI929374 

AA335721 



Hs.98458 

H3.223405 

Hs.98611 

Hs.180191 

Hjl299214 

H&3532 

Hs58668 

Hs.160822 

Hs.98706 

Hs271983 

Hs.166109 

HsS8750 

Hs.104921 

H&98842 

Hs29417 

Hsl98899 

Hs.104947 

H&99010 

H&98390 

Hs303222 

Hs.178222 

H&336677 

Hs21356 

Hs303223 

Hs.119316 

Hs59088 

HsS9090 

H&9460 

Hjl99123 

H&99127 

Ha.104980 

Hs39148 

HsS9152 

H&238151 

Hs.104990 

Hs234863 

HsS9195 

Hs.40368 

Hs.164589 

Hs293858 

H3262907 

Hs*9287 

Ha39303 



AF005216 



Hs3932) 

Hs39330 

Hs301960 

Hs38G23 

H&99339 

Hs.161873 

Hs.148135 

Hs.144802 

Hs.178358 

Hs.105119 

H&99500 

H&99545 

H&290996 

Hs293565 

H&334386 

Hs3625 

Hs.75367 

Hs.119394 

Hs283705 

Hs.115541 



ESTs 

gkz*50fD2s1 SoarBS„tota|Jelus_Nh2HF8^ 
ESTs, Moderately similar to A45010 X-fln 
EST 

hypotoeflca! protein FU14904 

Homo sapiens, done IMAGE2822295, mRNA, 

nemo-Wee kinase 

EST 

Homo sapiens cDNA: FU20863 fis. clone A 
ESTs 

ESTs, Weakly sinter to ALU5_HUMAN ALU S 
ELAV (embryonic lethal abnotmal vision, 
EST 
ESTs 

gbafflOdOlJI Soares.fiesfeJIHr Homo sap 
EST 

HCF^toding banscnpfion factor Zhangfe 

ESTs 

ESTs 

hypoftefcal proteh FU22263 strife to 

ESTs 

ESTs 

ESTs 

EST 

hypothetical protein DKFZp782K2015 

EST 

ESTs 

EST 

ESTs. Moderately similar to similar to K 

Homo sapiens mRNA; cONA DKFZp547C244 (fr 

EST 

EST 

ESTs 

ESTs, WeaWy simBarto S43569 R01H10.6 

EST 

EST 

ESTs 

Homo sapiens cDNA FU12082 5s, done HE 
ESTs 

adaptor-related proteh complex 1, sigma 

ESTs 

ESTs 

ESTs 

EST 

Homo sapiens cDNA FU11048 fis, done PL 

WAA1460 proteto 

ESTs 

hypoftefca) protein RJ235B8 

ESTs 

ESTs 

EST 

ESTs 

sertne/threonine kinase 33 

gbax48e06.s1 ScaiesJestfc.NHT Homo sap 

ESTs 

ESTs 

ESTs 

ESTs 

Homo sapiens cDNA HJ10658 fis, clone NT 
ESTs 

ESTs, Weakly similar to putative p150 (H 
ESTs 

NIMA (never in mitoss gene a)HBfatedk 



ESTs 
ESTs 

Janus kinase 2 (a protein tyrosine khas 
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65 



22870 AW576312 Hs318722 Homo sapiens cONA: FU21766 Bs, done C 

122872 AW031394 Hs371Q3 ESTs 

22879 AA769410 Hs.128654 ESTs 

22907 AA470074 Hs.169898 ESTs 

22916 AA47014O Hs^29170 EST 

22981 AA478951 Hs.105629 ESTs 

23013 AW968324 Hs.17384 ESTs 

123016 AW338067 Hs323231 Homo sapierscDNAFU11946fe, done VE 

23034 AL359571 Hs.44054 rtfceh (GSK3B foteracftg protein) 

23072 A1382600 Hs.104308 ESTs, WeaWy sim3ar to K1AA1395 protein 

23082 AA485360 Hs.105661 ESTs 

23088 AI343652 Hs.105667 ESTs 

123110 AA486256 Hs.193510 EST 

23114 BE304942 Hs^65B48 myomegalln 

23131 T52QZ7 Hs^71795 ESTs, Wi^ sinter to I38022hypotttefl 

23132 AI061582 Hs324179 Homo sapiens cDNA FU12371 fis, done MA 
23136 AW451999 Hs.194024 ESTs 

123149 AI734179 Hs.105676 ESTs 

123152 AW601773 Ha270259 ESTs 

23258 AA490929 Hs.105274 ESTs, WeaWy similar to RMS1_HUMAN REGUL 

23315 AA496369 gtezv37d10.s1 Scares ovary tumor NbHOT H 

23369 AA504757 Hs.105738 ESTs 

23394 AA731404 Hs.105510 ESTs 

I23433 AW450922 Hs.1 12478 ESTs 

I23466 AA599042 Hs.1 12503 EST 

I23470 AW303285 Hs303632 Human DNA sequencefcom done RM1-110H4 

23471 AB021644 Hs.197219 zinc firmer protein 14 {KOX 6) 

23475 BE439553 Hs35Q52B Homo sapiens, done (MAGE'4098694, mRNA, 

23482 N95059 Hs35098 ESTs 

23466 BE019072 Hs3348Q2 Hanosa^cO^RJ146^ ketone NT 

23508 AW360388 Hs.155546 WAA1Q80 protehc Gcfetessodated, gamm 

23815 AA609170 gteaf12a12j;1 SoaresJes8s_NHT Homo sap 

23619 AA602964 gb:no97cQ2s1 NCLCGAP_Pr2 Homo saptens 

23G58 AA609364 gfczu71d09.s1 Soares_tesfc_NHT Homo sap 

23874 AE69609 Hs.105187 Wnesto protein 9 gene 

23735 NMJJ13241 Hs35231 FH17FH2 (tomairHxmtalnhg prated 

123738 AA609891 Hs.1 12777 EST 

23753 AA6Q9955 Hs2349S1 Hunfingtin mteracflng protein E 
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24417 N34059 gteyv28hQ9.s1 Soares fetal Dvorspteen 
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m N71076 Hs.102800 ESTs, WeaWy similar to neuronal thread 
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24649 N92593 Hs313054 ESTs 

24661 R48170 Hs.78436 EphB1 
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124583 AA381661 Hs.1 19878 ESTs, WeaWy similar to M3K9 JIUMAN MITOG 

124712 R09166 Hs.191148 ESTs 

124735 R22952 Hs^68685 ESTs 

124761 AA374755 Hs33560 Homo sapiens rnRNA to KIAA1771 protein, 

5 124768 AW368528 Hs.100855 ESTs 

124775 R41772 Hs.100878 ESTs 

124777 R41933 Hs.140237 ESTs, Weakly sknflar to ALU1JHUMAN ALUS 

124788 R43543 Hs.100912 Homo saplerecDrttFU22726fis, done H 

124809 AL355722 Hs.106875 Homo sapfens EST from cto» 35214, fufl 

10 124811 R46068 Hs^88912 hypofteBcaJ proton RJ22604 

124812 R47948 HS.1BS732 ESTs 

124822 AA418160 H&86043 Homo sapiens cDNA FU13558 fe, done PL 

124825 AA501669 Hs336693 ESTs 

124833 AW975868 Hs^94100 ESTs 

15 124857 R63652 Hs.137190 ESTs 

124860 R65763 Hs.101477 EST 

124863 AB82555 Hs.127950 bromodomalrHantekimg 1 

124876 AF135422 HsJ7059 GDP-mannose pyrophosphorylaseA 

124878 BE397530 Hs^88057 hypomefcai protrfn RJ22242 

20 124902 H37941 Hs.101883 ESTs 

124903 AW296713 Hs321441 ESTs 

124930 A1076343 Hs.173939 ESTs, WeaWy simBar to ALUBjflJMAN Oil 

124942 R99978 H&268892 ESTs, Moo^ratdyslm8artoB34087hypol 

124958 M078645 Hs.431 murtieteukembvtral(r^1)onoogefieh 

25 124980 T40841 Hs*8681 ESTs 

12002 T59338 Hs369463 ESTs, WeaWy sfcnfer to ALU1_HUMAN ALU S 

125047 T79815 Hs779793 ESTs 

125051 T79956 H&100588 EST 

125056 TB1310 Hs.100592 ESTs 

30 125101 AM72068 H&286236 WAA1856proteh 

125113 T96595 Hs^02Z70 ESTs, WeaWy similar to ALUF_HUMAN [Hi 

125115 T97341 gkye57e05^1 Scares fetal Over spleen 

125125 AI222382 Hs340767 Human DNA sequence from clone RP1-12G14 

125147 W38150 EmpfrfcaHy selected from AFFX single pr 

35 125161 W44657 Hs.144232 EST 

125249 AA630863 Hs.131375 ESTs, Moderately sim8ar to ALUBjflJMAN I 

125255 AF0981S2 Hs.118631 6nntBSS(D(DSophaa)homi)log 

125279 AW4018Q9 Hs.4779 WAA1150 protein 

125280 AI123705 Hs.106932 ESTs 

40 125298 AW972542 Ha289Q08 Homo sapiens cDNA:aJ21814 fe, clone H 

125660 AW292171 H&23978 scaffold adachnent factor B 

1258Z7 NMJ0O34O3 HsJ7496 YY1 transcription factor 

125891 U29589 Hs.7138 chofciergfcrec8pkr,rnuscannlc3 

126005 AW409701 Hs.1578 bacukwlral IAP repeaMartaming 5 (sur 

45 126202 AA157632 H&272630 vaoiolaTpr^punpdetepdypepade 

126695 AA643322 Hs.172028 adlsirtBgiinarrfmetaflopr^^ 

127050 AW411066 H&274351 CGU39 protein 

127274 AW966158 Hs£8582 Homo sapfens cDNARJ12789fe, clone NT 

128355 AW293012 Hs.161623 ESTs 

50 128493 D87466 Ha240112 WAAOZ76 protem 

128522 BE173977 Hs.10098 pu^nucteobfRNAheficase 

1285Z7 AA504583 Hs.101047 transoipttontactor3(E2AlmnTimog^ 

128528 R39234 H&251699 ESTs, WeaWy sfrnDar to IDN4-GGTR14 Jta 

128595 U31875 Hs£72499 shot<iiainalcohddenydrogen^ 

55 128589 NMJH5366 Hs.102336 Rho GTPase acftaflng protein 8 

128604 AJ879099 rte.102397 GIOT-3 tac*wa<toropm hAicibtetransc 

128608 BE267994 Hs.102419 zinc finger protein 

128625 ABD37841 Hs.102652 hypofcetoprotehASHI 

128629 AUQ96748 Hs.102708 DKFZP434A043 protefl) 

60 128639 AW582962 Hs.102897 CGW7 protein 

128656 AA458542 Hs.10326 coatomerpctdna)rT^subunleps2o 

J28658 BE397354 HsJ2483D dtpfteratorti reslstence protein requi 

128670 AA975486 Hs.103441 Homo sapiens, Similar to RlKENcDNA 1700 

128691 W27939 Hs.103834 hyrwmetx^ protein MGC5576 

65 128696 BE081143 H&225977 nudearieceptorco3c6vator3 

128700 Y15221 Hs.103982 srr^inA)cibte<*toWra 

128714 T85231 Hs.179661 tubulin, bete 5 
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TABLE 7A 

Table 7 A shows the accession numbers for those pkeys lacking unigeneETs for Table 7. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey. Unique Eos probeset Marrtffler number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 

15 

Ptey CAT number Accession 

102481 31281.-28 U50360 
20 105032 genbanKuAA127818 AA127818 

409487 1134778J H19886 AW402806T1Q231 
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TABLE 8: Figure 8 from BRCA 001-1 US 

Table 8 shows genes upregulated in tumor tissue compared to normal breast tissue. 
Specifically, one column shows the ratio of expression of the indicated gene in breast tumor 
tissue compared to other body tissues, and another column shows the ratio of expression of 
the indicated gene in breast tumor tissue compared to normal breast tissue. 
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103658 NMJD00088 Hs.172928 collagen, type I, alpha 1 3.2 3 
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114781 T10446 Hs35388 ESTs 1 43 

114795 AB037858 Hs.173484 hypolhefcal proteh FU10337 13 9.2 

114901 AV660012 Hs.196437 hypofoefcal protein FU1Q788 1.4 52 
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131961 AA129782 H&3576 Hwno sapiens mRNA fall length Insert cDN 
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131997 AF229181 Hs.136644 CS box-contoWng WD protein 
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132208 AL031709 Hs241575 hypoftefcaJ protein CAB56184 
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132316 U28831 Hs.44566 WAA1641 protein 
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132718 NM.0O4600 Hs554 Sjogren syndrome antigen A2 (60kD. rfoon 
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132998 Y00062 Hs.170121 protein tyrosine phosp ha tase, receptor! 
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133011 NM.006379 Hs.171921 sema*>maHtamw0DD^ 
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zinc finger protein 238 1.5 43 

hypodtetiral protein MGC10753 1.4 63 

apot^poptotenA-fl 02 35 

H factor 1 (complement) 05 5 

add duster protein 33 \2 4.2 

DKFZP564l1922proteh 3.7 53 

KIAA0447 gene product 1.4 5.1 

hexoUhase2 0.9 63 

K1AA1235 protein \2 3.7 

^s^^n^cDrUDKFZp434G227(fr 0.7 43 

invnunogbbuQn tambda-fike polypeptide 1 1.1 43 

voftagejependent anion channel 3 0.7 M 

hypofit6&calpnMiMGC2714 3.1 53 

desmopfelmfin DPII) 43 115 
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H97991 Hs.193313 EmpMcaSy selected fiomAFFX single pr 1.4 33 
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AI301740 Hs.173381 (firrydjopynmk&ose^e 2 03 133 
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L77964 H&271980 mftogervacftrated protein kinase 6 1.1 63 

zinc firmer protein 146 13 33 

tyrosine 3-mc#xxoygeftaseftr^ 13 11.1 

heterogeneous nuclear ribonucteoprotein 2 3.9 

syrtapfcoohysin 0.6 33 

WAA0174 gene product 12 72 

CREBBP£P30Oteh3)ac^proteBi1 13 5 

zyrfn 1-2 43 

cadherin 11, type 2, OB-cadherin (osteob 3.2 4.1 

BE268649 Hs.177766 AlP-^ftjosyftransferase (NAEK pory (ADP- 2.1 33 

W24087 Hs.76285 DKFZP564B167 protein 13 123 

AFD75337 Hs.76233 foyrnosh, beta 10 2.6 63 

Gprotebcoupfed receptor kinase 6 1 43 

step I) spiring factor SLU7 03 33 

H1h)stenefar^.ri)efnber2 13 43 

Homo sapiens rnRNA; cONA DKFZp564B1264 (f p 5.6 

cartwxytesterase 1 (mctiocyte/macfophage 03 MA 

ESTs 55 23 

AJ815523 Hs,76830 syraJdeh,a^te(r»iMcorr^x)r^dain 0.6 43 

X07767 Hs.77271 protehktr^cAAffKiepenQ^cata^ 1 102 

argtnytirans^ase 1 03 45 

cabjmenh 23 105 

AM58213 Hs.77542 ESTs 13 5.6 

AL031177 Hs.7756 proteasome (preset, rracro^ 15 63 

L41066 Hs.77810 nuclear tector otadratedT-cefc,cyt 15 33 

M54968 Hs.184050 v4(kas2KIrsten rat sarcoma 2 viral on 03 43 

L15409 Hs.174007 von H^jpeRindau syridrorne 23 45 

AL04032B Hs.78202 SWI^ related, rr^assocteted.acti 33 3.4 

R48316 Hs.7822 Homo sapiens rnRNA; cONA OKFZp564C1216 (f 13 57 

BE150882 Hs.143601 bypofteecdpotetohCLA-to 1 65 

ubio^specfo protease 7 (herpes vl 1.7 33 

lysosomal 1 75 

matrin 3 1.2 4 

cycflnG2 2.7 45 

ntitochortdnal ribosoma! protein L3 33 2.1 

Z43039 Ha.170198 KIAA0009 gene product 13 35 

AF035606 Ks.80019 r^rarnrnedcefldeafti6 1.7 6.9 

U77735 Hs30205 ptrn-2 oncogene 03 53 

X68194 Hs.80919 syriaptophysMte protein 1.4 11.4 
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AW503116 Hs301819 

AB52558 Hs.75544 
W17187xomp Hs232400 

AI018666 . Hs.75667 

AW001130 Hs.75824 

A1929S87 Hs.75847 

BE410769 Hs.75873 

M62194 Hs.75929 



AW239400 Hs.76297 

D25969 Hs.76325 

AW578716 Hs.7644 

AW630088 Hs.76550 

AA345824 HsJ6688 

AA147026 Hs.76704 



AW859528 Hs301497 
AU076964 Hs.7753 



NMJB3470 Hs.78683 

AIQ27881 Ks.7869 

AF117236 Hs.78825 

NM.004354 Hs.79069 

BE513171 Hs.79086 
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134277 NKL004369 


Hs509&D collagen, type VI, alpha 3 


Z6 


35 




134280 NM_00O712 


Hs51Q29 bflwerdln reductase A 


18 


55 




134288 AI022650 


Hs5117 erbb2-toieiac&g protein ERBIN 


1.1 


3.6 




134296 R00603 


Hs512B p^wsphrtffdylsBnnft dscfliboxyfo^ 


1.1 


55 


5 


134300 KM 0D1430 


Hs5136 erKtotheBa} PAS domain protein 1 


0.5 


45 




134310 A1037800 


Hs.P1 48 setenoprotein T 


1.7 


75 




134343 D506S3 


Hs52028 bsnsfofmny yiowffi lectori bsta PSOGpto 


0.8 


75 




134364 X76534 

U"KJU1 A/KMT 


H*s HTTvfi fitvmmrifffln fliiiiiKiiiuiiittfsnfii nmb 


22 


3.6 




1343/4 N22687 


Hs_8236 PSTs 


1.9 


35 


10 

J. \J 


13437A Ain35786 


Wcft?4?5 %fin rested nmtski 2/3 onrnnlex. sufaun 


15 


85 




lOtOOX Dlj Immi 


1 I^K Iww 1 W Vv UUIIKUI vRHwPJ HUmHIUV PVwTIHHI |fl 


1.1 


3.6 




IWI3 nlfJUfUfc 


riAUCwl 1 |J1 i/CDH 1 LJIUyMW |W(WayiWllflnCT t/f*** 1 III 


15 


4.6 




iwi 1 1 uuwh to 


kt&ft2921 snbite cuT¥>r tami)v 35 ICftfP-siaSc ad 

rW^WfeJfc 1 aMUIP wQIIIC* KUIUIj ^Wlni ^KUIw BM 


M 


75 




144491 A) Ifl77lQfi 




6.6 


8.7 


IS 




Uq 4 QftpiSn Phn fTTPaoo nrAfnfinrt nm^piri 1 
niLiJOOW rum \3ir oskj ownauiiji iaucii i 


2 


35 




ilAAHA Uli fV4'»yift 


LJo Ofifii94 f^Tt94 fmfinpn fvnftR rpR btnn rAfrfnrvra 


35 


1.1 




4Qjjtfn niicQfti 
IJVkH UotwOl 


l|» |UfUl4 amA'uA hx^tet iwoircftrnmlpgi ffvtfinhg 

rKMrHNn CUlljUiU Utw |HCUII«tUI |JIUIOIII ^bJHfpud 


1.5 


4.4 




4<iAc/v4 WftififiQ 

1349U1 wowoy 


fTT r 1 BUKnijOUP UOIIMoUWI IPUQUmi loHW 


12 


5.7 




]J*KjU3 MmUDUD/O 


Ma 477 I i3fl ATP awnttmoA I44> traiwfwriirw nwtnrhwirt 
ikL 1 / / iUU Mir ajiiuicBiCt NT uoiia^wuiy, iimwmiu 


1.3 


35 


9ft 


iiiRon DcntHfVK 

1 DCUiflWO 


rlSL/ tOO 1 obUVafcUU Win |Mjlimaao U UallaU IpUU 


13 


45 




134910 MzdlDl 


Uq RATT^i Human h m >QfWVC/wJBf o atamont mDUA 


OA 


5.6 






nSw00159 OUtJialB Ic^OTISEe wtM I \EurHcSpui5B 


1 4 






134993 AUU3949 






35 


OS 


4 1AU.n UU A4C4AO 

134573 NM_Ulbi4z 


MSuu»>i/ scBiuia oenyufugenasB nofnotug 


1 % 




134570 AoUJoulr 




v.? 


3.7 




404CT7 ocoAAVft 
1349/ / DtZ'WJZJ 




4 


65 




1349/ff nVVSMOSKO 




22 


45 






nSwoOU'l 1 xAsfa uipHi repeal mnonj pmuan i 


15 
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1340UU MTVf 00351 


HsJ6347 bypolheflcal protein 


21 


35 


134099 ArZoagUo 


Hs.123090 SWl/Sr^r^a1ed,rnatrt(assodated,a^ 


1.7 
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4U7FV1 AlflWlCnC 

134/W AaUUUDUD 


Hs^868 go^l SNAP receptor complex rremberl 


4.4 
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13473/ D17530 


Hs^9434 drebrinl 


ai 


1 fi 
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14J7R9 TOQJK 
134/OZ 1 51900 


Ks^83108 temogbUn,ganvnaG 


05 


4.6 


"is 


14AAA4 AAAOQgQft 
13W» /W><c09ZU 


Ks^OOGI progestefone bhtfing protein 


15 


3J 


134oo4 JU3454 


Hs.179573 colagen. type I, alpha 2 


8.7 


17 X 




134005 AA50r//5 


Hs.66295 rnuffi-FlH-domai^contair^ 


1.7 
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134000 ADUAJOO? 


Hs.90419 WAA0882protefo 


3.4 


05 




404074 AlftfOTftl 
IMOIQ MUHM/OI 


Hs^90458 serine poirnftDyftransfaai^, long chain 


15 


65 




134009 /WIUOMU 


Hs.9071 progesterone nsmbrane binding protein 


1.4 
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Hs.90787 ESTs 
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134sU0 Dcuoy/oz 


HsW7 hypoSietical protein 
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135011 AB037fl35 


H&92991 NAA1414proteh 
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1MB? AW301984 


Hs.173685 hypodie&al protBn RJ12619 
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4«nv; At Q34344 


H&284186 forkheadboxCI 
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13SDS1 AI272141 


Hs33484 SRY{seao^eTm)nk)gregkxiY)4)Ox4 


42 
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135060 AKD018fl7 


Hs^59842 protefe)ldr^e t AKP-^vatedgarnrna2n 


15 


45 


135062 AKD00967 


Hs.93872 KIAA1682proteh 
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135077 AW503733 


HsS414 (OAA1488protBki 


28 


37 




135082 AB017363 


Hs^4234 mzzted(Drosophfia)hoinotog1 


24 


45 




135W T97257 


Hs337531 ESTs,MaJefate^sfrnBaitot36^hypot 
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135143 AA132813 


Hs.69559 WAA1098 protein 
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135156 BE563088 


Hs.9552 binder dArlTwo 
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135172 AB028956 


Hs.12144 KIAA1033pioteln 
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135181 BE2508G5 


H&278529 pc19^tce protein 
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75 




135222 AA534009 


Hs.183487 MerfBfDns&rm^a^(20fcD) 


15 


35 
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135289 AW372569 
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135290 AA331901 


Hs.184736 rrypcjme8calprotehaJ10097 
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35 




135291 T83882 


HsJ7927 ESTs 
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135349 AA114212 


Hs^930 serine [or cysteine) protefoase inhfbto 


26 


85 


65 


135357 AI565004 


Hs.79572 cathepslnD(iysosc^aspartyl protease 
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135398 M16029 


Hs,287270 ictproto^mcog«)e (mutate endocrine n 


0.4 


75 




135399 W79431 


Hs326249 ribosomai protein L22 
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45 




135400 X78592 


Hs.99915 arrirog^ receptor (diiydrotestoste 
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302892 
3029S3 
3(0131 
5 303150 
310125 
312662 
319429 
320591 

10 406779 
410891 
410763 
415738 
420186 

IS 4220S5 
425815 
426218 
427397 
427466 

20 427505 
427723 
428673 
430219 
430450 

25 432866 
433423 
437562 
437667 
437754 

30 440252 
441471 
448292 
449404 
449964 

35 451389 
452685 
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R99693 Hs224410 Homo sapiens cDNA RJ12843 fts, done NT 3.6 
AW176909 Hs.42346 
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55 



60 



65 



AW673106 Hs.151945 
AW081061 Hs.103180 



AA887146 
AA147979 
AA233808 



Hs3217 

Hs2850Q5 

Hs286241 



BE616412 Hs286218 

AA0547B1 Hs.169149 

AA412048 Hs279574 

AW239226 Hs35450 

AF279145 Hs3966 

BE539367 HS295953 

NM015925 Hs35697 

NhU>14320 Ks.111029 

R94Q23 Hs337531 

AF113043 Hs.168005 

A1929685 Hs.177656 

AA523543 Hs.7678 

AA361562 Hs.178761 

AB5526Q Hs279789 

AW501325 K&324278 
Hs235887 



R23553 Hs241489 

BE395875 Hs2796Q9 

8E407127 Hs3997 

AB001636 Hs3683 

BB616412 KSJ286218 

R60366 Hs3822 



caWneurti-Wnditg proteki cabaidn-1 33 

mitochondrial ribosomal protein L43 OS 

DC2 protein 3 

stromal antigen 2 62 

niftochQndnal Import receptor Tont22 12 

protein kinase, cAMP-dependent regulato 1 

juncfonalatfiesicfl molecule 1 13 

kaiyophen^a^1(trnpoftkie^5) 12 

(X^pio^ceBdee8HegutatDjypr 13 

refcuton4 12 

hypothetical protenFLEI 778 2 

ESTs, Weakly stmSar to AF220049 1 uncha 13 

Iver-specfebHUi-Z^transciip^ta 13 

putative henie-bindlng protein 2 

ESTs, MooaatetysbnSar to B8Q22 hypot 1.7 

Homo sapiens cDNAFU13372fla, clone PL 33 

catmohjfo 1 (phosphoiyla^ 13 

ceflularre&^ado^bbalngprofefril 1.1 

26S protBasome-assodated pad1 homotog 32 

htetorw deaostylssft 3 2.8 
Homo sapiens mRNA; cDMA DKFZp566MQ63(!ir 

HMT1 (hnRNP meflryitransferase, S. cerevi 13 

hypofheScal protein 1.1 

rrutochondrfal canter homotog 2 13 

heat shock 70kD protein 1A 13 



BE513940 
AL042986 



HS3101 

Hs.7857 



BE281316 Hs.47334 

H51066 Hs23581 

AWD01741 Hs273193 

N73222 Hs279009 



A1634651 
RCLH15847JJ 
RC.W84712 
X14008_ma1J 
RC_H86543_f 
H07011 

RCJW645B6J5 

RCLAA070485 

RC_H98714j5 

ROMW6145J 

AA458584 

AAQ31548 

X02761 

RCLAM87193 

R25326 

RCLAA393805 

RCLAA449333 

RCJW287681_s 

RCJM490864 

RC.C14243J 

R21443 

RCLAA251902 

M2112U 

C00038.S 

Y00503 

RCJQ7006J 

RCJVA416886 

RCJ\A460450 

RCLAA488433 

RCJWZ78400J 

U28831 

RCLAA199588 
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WTiaf muscutoaponeurofic fibrosarcoma (a 
pepttdylprolyl Isomeiase B (cydophlBn B) 
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Homo sapiens mRNA for putative vacuolar 03 
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ESTs 23 

ESTs 
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ESTs 13 
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ESTs 23 
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ESTs 13 
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1.6 
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33 
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33 

62 
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62 
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AF0O6O82 


Homo sapiens acfirwetated protein Arp2 (ARP2) 


1.6 


10.9 


RCJBQ899 


desmopla)dn(DP1;DPI0 5.4 


5.5 s 




RCJW95070 


desmopbkfnfDPI; DPII) 5 


2.6 




RCJ9Q946J 


Human mRNA for WAA263 gene; complete ods 


1.1 


33 
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• protease; serine; 11 {IGF binding) 2.4 


35 




AA313414.S 


ESTs; WeaHy similar to cONA EST EMBLT1157 


1.5 


53 




ESTs; WeaMy similar to strrflar to Yeast 1.3 
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23 
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AFFX«UMRGBM1009B_5 1.1 
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7.4 
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5.4 
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ESTs 
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TABLE 8 A 

Table 8A shows the accession numbers for those pkeys lacking unigenelD's for Table 8. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 

Ptey: Unfcjue Bos probeset Identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT number Accessions 



106489 116761 1 AA079487 AA123547 AA128291 AA079587 AA079600 
20 125076 130299 1 AA973971 T88817 AA253263 

114636 109698.1 AA075488 M129081 AA074851 AA082852 AA074732 AA084908 AA084751 AA076042 AA131172 AA085374 AA079519 
AA074510 AA1 13824 AA1Q2437 AA070833 AA070143 AA084693 AA084389 AA076373 AA075492 AA062B34 AA084335 
AA078829 AA079344 AA069916 AA079275 AA070914 AA654069AA081976AA0^0957AA083115AA070942AA085296 
123526 geriban)LAA6086o7 AA608657 
25 123533 genbanJLAA6C8751 AA6O0751 
125090 £mbankJ91518 T91518 
125154 genbanK.W38419 W38419 
118475 genban»LN66845 N66845 
118505 Qenban»LH67343 N67343 
30 101046 GfrtiB2j<D116OK0116O 

129982 221J67 Z1 4221 AW38 1862 M97920AW401444Z66542M^470AW406502^ 

AW403806 Z14200 AA383972 Z14205 Z14201 M18513 Z14202 AW403684 X14584 AF062221 U43760 X65892 X65883XB2107 
Z80847 X65885 X65893 AF062142 XB5891 X17675 Z47Z74 Z47Z77 Z47276 X65888 Z47275 XB2109 AF062140 101278 
AF062134 AF062139 X81723 Z80840 XB1733 X81743 X81744 X81732 Z80843 AW402942 AW403516 X65919 AF062190 
35 AF062177 AF062222 AF0621 15 Z47240 AFC62263 AF062261 AF062223 AF06221 1 Z47238 AW401714 AW404008 AW404991 

AF062280 M99595 Z47214 Z47232 Z47218 M26995 AFD62184 XB5895 L38433 X81731 Z1 1946 Z47226 AF062205 AF174012 
L01276 AFD62168 AF062136 XB1755 X81748 AF174019 AF062285 L06924 L11699 AW402665 L09070 L28049 L08090 
AW407843 Z14171 AW402944 108083 AW405627 L33035 L26907 M17750 L28052 M17751 AJ239360 U19885 L14821 X56526 
AB014341 U2D87 L12098 U68231 L12184 AF062242 AF022000 US4499 U00570 AB68604Y15773 X64239 XS2969 U00506 
40 X73605 X99360 U00o77D83677AB021539AHB5ra 

D83676 ABQ21529 Z18318 U00488 L01412 X81746 U21262 U21272 U00560 AF174060 U00547 U00561 Z18321 S73957 
M26435 AF115130 AF115117 Z92896 U21254 Z92895 AF1151 12 AHB2290 T28938 Y09386 AF174067 UZ7189 Y09384 
U77373 AF174Q57 M17749 X69692 AF174038 U64478 U64486 AF174098 X99365 AF174077 AF174083 AF174089 AF015133 
AF127792 AJ006171 U0Q51O U58144 AF004323 AF115109 AF004324 AP015130 AF090414 AFQ90418 AB021536 AB021519 
45 ZS6957 AF021986 Z32898 296956 L34164 AF062251 AF052524 U0Q549 UQ0541 D83887AF052522AF087420 AFQ52520 

AFO2201 1 L43085 U00536 L19915 L43083 AP09042D AB021530 Affil21534Z96955Z98954 Z339O3X62964AF1O32»UOO509 
AFl)62298AR)62289AF0623(J7AW408326L33037U)433^ 

AF1MQ92 AI906954 AF1031B4 104343 AW364860 AF001424 AF103163 Z98717 AF103143 AJ008250 L04323 AF103321 
AF103335 104342 L03818 L03817 AW404978 AW403690 U86801 AF103150 AJ010334 AF0350Z7 AJ0073Z7 AF1031 15 
50 AF017458 /U008207 AJQ08183 AJ008196 AJ0Q8241 AJ008208 AF103210 AF068668 AF068670 AF068871 AF066664 

AF068669 AF068666 AF068665 AFD68672 AW371244 AW403670 AW408074 AW404575 AW3621 53 AW403803 AW406702 
AW351514 D78345 T29140 J00231 NKLOQ2179AW405146AA301091 X04646 H64660 AW402990AW406534 T93007 
A1857980 AW368899 ABG5833 AW406586 AA482084 AI872299 AA715266 AW404328 AI831674 AI709348 AA6031 12 
AW514864 AM85775 H64492 AVW04789 AA487630 M715498 AA295885TZ7613T98113 
55 108470 o^rtoMA079500 AA0795OO 

101447 entn*_M21305 M21305 

124447 goibanKJW8000 N48000 

101624 entreU*55998 M55998 

131791 221 J60 X621 11 S67984 AJ131056 Z47243 Z47235 AF062268 Z47237 AJ131058 AF062130 AF062282 X62108 AA385989 AA464794 
60 X69861 AW4O2964hfe0808Z98735Z98734Z98736AFO35O18X79161 U0Q545 AF174046 AF174071 U00552 U96288 

AFQ21989AF062255AF174061 S73953 AF062135 AF0621 55 X64147 U00555 AF174070 AF129754 U86769 Z38714 Z98738 
S75886 AJ008175 AJ008166 AJ0081B8 YD8403 H15753 H22208 AA327241 AW405737 H42300 K39677 H26239 H26903 
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H45128 R86072AA327565 AI660584 AW361537 AW383759 S71043H15014 H45570 H42819 H45523 H45134 R72043 
H24543 H27636 H27610 T28147 H25496 AW364071 AW364072 H45561 H45556 H42605 AI735017 T47421 R46719 H27570 
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TABLE 9: Figure 9 from BRCA 001-2 US 

5 Table 9 depicts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 
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TABLE 10: Figure 10 from BRCA 001-3 PCT 

5 Table 10 depicts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 
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33765 M62194 Hs.75929 
33780 AA557660 Hs.76152 
33784 BE622743 Hs301064 
33814 NM.002462HS.76391 
33829 AW6300B8 Hs.76550 
33845 AA147026 Hs.76704 
33913 AU078964 Hs.7753 
33968 AA355986 H&232068 
133990 R48316 Hs.7822 
I33999 AA535244. Hs.78305 
34032 NM_005Q25Hs.78589 
34064 AF091622 Hs.78893 
34087 U51166 Hs.173824 
34089 R51273 H&79029 
34095 NM.004354HS.79069 
34098 BE513171 Hs.79086 
134110 U41060 ■ Hs.79136 
34125 NML014781Hs^0421 
134246 D2B459 Hs30612 
34257 C05768 Hs3078 
34272 X76040 H&278614 
34282 R45621 Hs31057 
34288 AIQ22650 Hs3117 
34321 BE538082 Hs3172 
34326 AW903838 Hs.81800 
34328 AW959281 Hs3184 
34348 AW291946 Hs.82065 
34359 NMJD01982K3.199067 
34367 AA339449 H&82285 
34374 K22687 Hs3238 
34380 AU077143 Hs.179565 
34395 AA456539 Hs3262 
134401 A1916662 H&211577 
34405 AW067903 Hs.82772 
134415 AT750762 Hs32911 
34417 NHJXJ6416Hs32921 
[34419 W95642 H&62961 
34421 AU077196 Hs32985 
34436 U29344 H&83190 
134485 XB2153 Hs33942 
34487 AFD61739 Hs33954 
34495 D63477 Hs.84087 
34520 BE091005 Hs.74861 
34542 M14156 H&85112 
34570 U66615 Hs.172280 
I34590 AW903849 Hs.173840 
34604 MyL0Q2884Hs^65 
34612 AW068223 Hs.171581 
34643 AW299723 Hs37223 
34654 AKD01741 H&8739 
34656 AI750878 H&874Q9 
34672 AF271212 Hs322901 
34700 AKD00606 Hs3888 
34711 XD4011 Hs38974 
34722 AF129536 Hs384226 
34856 BE281128 H&9030 
34880 AI879195 H&90606 
34917 XB7241 Hs.166994 
34921 AL137491 Hs.125511 
34982 AKD02085 Hs32308 
I34989 AW968058 Hs32381 
35029 K58818 Hs.187579 
135035 AL034344 ^284186 
135051 AI272141 Hs33484 
35062 AK000967 HS33872 
35073 W55956 H&94030 
35098 AW274526 Hs277721 



cadh&in11 > type2 f 0^cadherln{osteob 3.2 
deoorin 5.4 
arfeptkil 4.7 
rnyxovlrus (Influenza) resistance 1, homo 33 
Homo sapiens mRNA; cDNA DKFZp564B1264 (f 6.7 
ESTs 62 
caSumenin 33 
franscnptbn Iactor8 (represses infcert 3.7 
Homo sapiens mRNA; cDNA DKFZp564C1216 {f 3.4 



RAB2, member RAS oncogene ferity 
serine (or cysteine) proteinase bihibfto 
K1AA0244 protein 
mymhe-DNA grycosybse 
ESTs 
cycftnG2 

iiAochoiidrlal ribosomal protein L3 
UV-1 protein, estrogen regulated 
K1AA0203 gene product 
ubk?ui&H»rquga^en2ymeE2A(RAM 
Hornosap^dofteFBD3Cn^Ju-diatcrft 
protease, senn8« 15 
hypofrefcal protein MGC2718 
ertfl>2-fiieracfing protein EHB1N 
ESTs, Moderately shriQarto A46010 X-fin 
chondroffin sulfate proteogrycan 2 (vera 
ESTs 

fcteiteuMn 6 signal transducer (gp130, 
*enVb2 avian erymrcHasflc leukemia v 
phosphcntosyferycmamkle formytoansfer 
ESTs 

mlr&hfomosorna maintenance deficient (S. 



tfnecfa 1 {Wnesm receptor) 
coDagen, type XI, alpha 1 
protein tyrosine phosphatase type IVA,m 
soJutB carrier fiamfly 35 (CMP-sialfc act 
trefofi factor 3 {MestinaO 
coflagen, type V, alpha 2 
fatty ackt synthase 
caffiep6inK(pycncxJysostoso) 
protein associated wfft PRK1 
WAA0143 protein 

activated RKA polymerase II transcrfptk) 
hsulMke grown factor 1 (somatomedl 
SWt/SW rebteo; matte associated, acfl 
HUa (C>toff1Hnteiacbng protein 
RAP1A, member of RAS oncogene famBy 
ubiquifin C-feaTiflJiiaJ hydrolase UCH37 
bone morphogenette protein receptor, typ 
riypdheticatproteh FU10879 
ftroenbosponctin 1 
dfenflrterof siteiuiikfl 10 

SNAP receptor complex rri^^ 
cytochrome r>245, beta polypeptide (chro 
F-txa only protein 6 
TONDU 

15 kDa selenoprotein 
FAT turror suppressor (Drosopr^^ 
HomosapieremRNA;cDNADKFZp434P1530(f4 
Kom sapiens d)NAr^1223n^ done PI 5.1 
nudfc(nudeosHecaphosp^ 8.2 
hydoxysterold (17-teta) dehydrogenase 7 113 
forkheadbootCI 5.4 
SHY (sea debmiW^ region Y>tat 4 3.3 
K1AA1682 protein 33 
Homo saptens mRNA; cDNA DKFZp586E1624 (f 8.1 
ovarian carcinoma anflgen CA125 3.3 
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TABLE 10A 



Table 10 A shows the accession numbers for those pkeys lacking unigenelD's for Table 10. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed* Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

10 

Ptey. Unk^ Eos probes^ ktentffier number 

CATnumber. Gene duster number 
Accession: Genbank accession numbeis 

15 

Pkey CAT number Accession 

123619 371681J AA6Q2964AA609200 

20 104602 52448^2 H47610R8692D 

121581 283769 1 AA416568 AA442889 AA4 17233 AA442223 

123523 p^rtbanKjAA608588 AA60858S 

100821 tigr_HT4306 M2S460U09116 

12091 g8nbanKJ91518 T91518 

25 125150 ^J r OUND.errfrB2LW38240 W38240 

118475 genbanK_N66845 K66845 

104787 genbartLAA027317 AAQ27317 

106055 genban)LAM17034 AA417034 

113702 genbanKJ97307 T97307 

30 101046 entezJ<D1160 KD1160 

101447 entTC2LM21305 M21305 

101624 entre*JVI55998 M55998 

124677 geiibanJLR01073 R01073 

110581 genbanXJ161560 H61560 

35 119023 genbanK_N98488 N98488 

110775 genban)LN22414 N224I4 

112092 genbanK_H44538 R44538 

112S3 senban)LP51B1B R51818 

107014 genoan)LAA6^8820 AA598820 

40 114988 genbanlLAA251089 AA251089 
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TABLE 11 : Figure 11 from BRCA 001-3 PCT 

5 Table 11 depicts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 

Pkey. Unique Eos pnobeset Identifier number 
10 ExAcac Exemplar Accession number. Genbank accession number 
UnfeenelD: Unigene number 
UnfeeneTlfle: Unjgene gene fife 

R1: Ra&oftunwtarwrrnaJbooVSssue » 

Kt Ratb of GO^percentte tumor tommal body 

IS R3: Ra&ofT^peicentitenoriralbodyto 
R4: Ratfo of turror to nonrel breast fissue 
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Pkey ExAccn UnlgeneE) UnigeneTffie 



100131 D12485 Hs. 11951 

10)147 D13G66 Hs.136348 osteoWast specffic factor 2 (bsdcto 

100522 X51501 Hs£9949 pnrtacfin-lriduced protein 

100666 LD5424 Hs.169610 CD44 arrfigen {honing ftmcficn and Indian 

25 101104 AW862258 Hs.169266 MuropepfideY receptor Y1 

101478 NH_002890Hs758 RASp21 pfrtehacfivator(GTPaseac8va 

101724 L11690 Hsj620 buDouspernpruyAI anfigen 1 (230J240K)) 

101754 S70114 Hs239489 71A1 cybtodco^arwle-assodated RNA-bi 

101888 AL049610 Hs35243 trarecnpion etongafon tactor A (SU>- 

30 102165 BE313280 Hs.158627 associated protein 3 

102304 AP015224 Hs.46452 mamrnaglobii 1 

102348 U37519 Hs37539 aldehyde dehydrogenase 3 farnfly. rrerriber 

102457 NMJM1394Hs2359 dual spedfcBjr phosphatase 4 

102567 U63830 Hs.146847 TKAF family rrBrnberassodated NFKB activ 

35 102823 D85390 Hs5057 rarboocypepfidaseD 

103557 AL133415 Hs297753 vimentin 

103613 NNL000346Hs^316 SRY(sexde}smMgrep>nYHx]0(9(ca 

104115 AF163810 H&261Q2 orjpostestrarritotrfcrKJ^ 

104667 AE39923 Hs30098 ESTs 

40 104804 A18587Q2 Hs31803 ESTs, WeaWyslmtotorWA^>|Rsapten 

104807 AH39058 Hs.125790 leudne-rich rBpeatcrotaftng 2 

104896 AW015318 H&23165 ESTs 

104943 AF072873 Hs.114218 fiMed(I>osoprfe)r»rtK)log6 

105038 AW503733 Hs3414 WAA1488 piotefc 

45 105329 AA234561 Hs22862 ESTs 

105500 AW602166 H&222399 CEGP1 protein 

105516 AK001269 Hs30738 hypofte&al prutefo FU10407 

105730 AW377314 Hs5364 DKFZP564I052 protein 

106012 AI240665 H&8895 ESTs 

50 106095 AF115402 H&11713 E744ikefiac^5(e1s(fc)rratntrar^ 

106155 AA425414 Hs.33287 nuctear factor l/B 

107102 AB037765 Hs3C652 WAA1344 protein 

107136 AV661958 Hs3207 GKD01 protein 

107151 AW378065 Hs.8^7 ESTs 

55 107922 BE153855 Hs.61480 lg superfamDy receptor LN1R 

108339 AW151340 Hs51615 ESTs, WeaWys^ to ALU7J1UMAN ALUS 

109112 AW419196 Hs^57924 hypoffteSca) protefc FU13782 

109292 AW975746 Hs.188662 WAA1702 proteto 

109415 U80736 Hs.1 10826 trinudeofide repeat oorrfainmg 9 

60 109912 AW390S22 Hs301528 L-kyrairerBrffi&$^^ 

110009 BE075297 Hs.6614 ESTs, Weakr/sJrn3artoA43932rnucto2p 

110915 BE092285 Hs29724 hypo^eScaJ protein RJ13187 

111164 (446180 Hs.122489 Horrx>sap^(flMFU13289fis,(^OV 

210 



R1 


R2 


R3 


R4 


132 


244 


19 


9.9 


157 


1030 66 


5 


22.7 


760 


34 


1.4 


85 


85 


1 


32 


153 


153 


1 


14.1 


9.6 


96 


1 


65 


9.4 


94 


1 


03 


8.9 


89 


5 


6 


7.3 


73 


1 


53 


93 


93 


5 


8 


65 


2058 243 


1.4 


6.4 


428 


67 


23 


202 


202 


5 


13 


82 


82 


1 


63 


5.6 


56 


1 


53 


7.5 


136 


18 


3.4 


73 


73 


1 


52 


29 


290 


1 


263 


145 


149 


1 


6.4 


7.7 


77 


1 


5.1 


7 


70 


1 


65 


7.4 


74 


1 


6 


162 


162 


1 


42 


55 


55 


1 


52 


2.8 


131 


47 


3.9 


25.4 


508 


20 


3 


83 


83 


3 


13 


6.9 


69 


1 


4.4 


212 


212 


6 


17.4 


263 


356 


14 


1 


93 


483 


49 


13 


63 


63 


1 


5.4 


25 


392 


155 


43 


155 


156 


7 


103 


9 


90 


1 


55 


18.7 


187 


1 


17 


4.1 


334 


82 


3.4 


7.1 


71 


1 


65 


123 


123 


1 


113 


142 


142 


1 


95 


6.3 


693 


110 


72 


203 


209 


1 


195 


7.7 


77 


1 


5 



WO 02/059377 



PCT/US02/02242 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



11179 AKD00136 Hs.10760 aspork) (LRR class 1) 
1190 AKD02055 Hs.151046 hypofteiical protein FU1 1193 
1223 AASS2773 H&334838 WAA1866 protein 

11357 BE314949 Hs37128 hypofoetfca] protein RJ23309 

12244 ABQ29000 Hs.70823 KIAA1077 protein 

13047 AI571940 Hs.7549 ESTs 

13702 797307 Qbcye53h05^1 Soares fetal few spleen 

14124 W57554 Hs.125019 tyn^rfwWmKieaf prot8^(LAF^)mRNA 
14138 AW384793 Hs.15740 Homo sapiens mRNA; cDNA DKFZp434EQ33 (ft 
14768 AF212848 Hs.182339 ets homologous factor 
14880 AL157545 Hs.42179 bTOinodomaln and PHD anger corrtatohg, 3 
14965 AI733881 Hs.72472 BMP-K1B 

14988 AA251089 gfczs04f05.s1 NC1_CGAPJ3C81 Homo sapiens 

15206 AW183695 Hs.186572 ESTs 

15719 AW992405 Hs59622 Homo sapiens, done IMAGE3507281, mRNA, 
15844 AI373062 Hs332338 hypofcefcaJ protein MGC5370 
16470 AI272141 Hs33484 SRY (sexo^errnWng regtonYH»x4 
16786 HSB36 Hs301527 ESTs, Moderate^ sinto to unJaiown{Hj 
17280 M18217 Hs.172129 Homo sapiens cDNA: FU21409fe,doroC 
17412 N32536 Ha.42645 sotecanierfan%16(inonocarboKyfc 
8472 AL157545 Hs.42179 brorrodomah and PHD fingwcontahAig, 3 
19271 A10S1118 Hs,65328 Fanconl anemia, comptemenfafon group F 
19771 AB05687 H&2533 EST 
120562 BE244580 Hs302267 hypofteficaJ protein RJ10330 
21463 AKD00282 Hs239681 hypofteflcal proteh FU20275 
121723 AA243499 H&.104600 hypoftetol protein FU10134 
I22963 AA478446 Hs.69559 WAA1086 protein 
23137 A1073913 Hs.100686 ESTs, Weaty sinter to JBD350 Anterior 
23619 AA6Q2964 gbmo97c02s1 NC^CGAPJW Homo sapiens 

23709 AA706910 Hs.112742 ESTs 
24006 AH47155 Hs270016 ESTs 

24059 BE387335 H&283713 ESTs, WeaMy sim2arto S64054 hypofoeii 
24308 AA249027 Hs241507 ribosomal protein S6 
25279 AW401809 Hs.4779 K1AA1150 proteh 
25617 AA287S21 Hs.164950 ESTs 

27439 D60237 Hs.14368 SH3oVm^btna1nggtulanik;ack^ 

128305 AI954968 Hsl279009 matrct GJa protein 

128482 A1694143 Hs296251 prograrrinedcefideaM 

28790 AF026692 Hs.105700 secreted taled-related proteM 

[28925 R67419 Hs21851 HornosaptenscONARJt2900fis i done NT 

29017 AA115333 Hs.107968 ESTs 

29229 AF013758 Hs.109643 pctyartjnytatebtairgp^ 

29337 NMJM4918HS.110488 WAA0990 protefi 

29366 BE220806 Hs.184607 Homo saptens done 23785 inRNA sequence 

129821 ABG28945 Hs.12696 cor!ac&SH3*>ri)afe-b^ 

130036 BED61916 Hs.125849 chronBSon«8openrea*^6ame2 

30057 AFQ27153 H&324787 solute carrier famBy 5 Pnosltottraep 

I30095 AKD01635 Hs.14838 hypoftetelprofefri FU10773 

30343 AB040914 H&278628 KIAA1481 proteti 

30385 AW067800 Hs.155223 stanribcaWn 2 

30407 BE385099 Hs334727 nypomeScal protefo MGC3017 

130441 U63630 Hs.155637 protein tdhase, DNAacfivated. catalyfc 

30455 D90041 Hs.155956 ^^ace^*BKfefase 1 {arytarntneNacety 

30604 AA383256 Hs.1657 estrogen receptor 1 

30617 M90516 Hs. 1674 ghrtan^nictose^ihosD^ 

30712 AJ271881 Hs279762 bromoo^mi3kMSOfibinkig7 

31148 AW953575 Hs303125 p534nduced protein PIGPC1 

31388 NM.014810KSJ2200 KIAA0480 gene product 

131564 T93500 H&28732 Horno sapiens cDNAFUl 1041 fis, done PL 

31742 AA961420 Hs31433 ESTs 

31877 J04088 Hs.156346 topotonerase (DNA) D alpha (170K>) 

31985 AA503020 H&36563 hypofoefcal protein FU22418 

32316 028831 Hs.44566 K1AA1641 protein 

32528 T78736 Hs50758 SMC4 (structural nrintenanceofduornosD 

32742 AAQ25480 H&292812 ESTs, Weakryarn2artoT33468hypotf^ 

32990 X77343 Hs334334 transcrtpto«actorAP^atpta(edivat 

33015 AJ002744 HsJ246315 UDP-NnaceryMpha^ 
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133199 AF231981 Ksl250175 homotog erf yeast long chain polyimsatura 


3 


816 


275 


3.9 


133240 AKD01489 Ha£42894 ADMbosytattan facbu^ke 1 


8.1 


81 


1 


4.6 


133271 Z48S33 H&283742 Haptens mRNA for refcriransposon 


12.4 


124 


6 


105 


133640 AW246428 HsJ5355 uWr^rifiiKo^gaHng entire E2N(bomok) 


85 


85 


1 


72 


133746 AW410035 Hs.75862 MAD (motes agaif^decapentaptegk; Dr 


95 


93 


1 


7.8 


133939 AA53S244 Hs.78305 RAB2, member RAS oncogene family 


75 


78 


1 


5.6 


134110 U41060 Hs.79136 UV-1 proteh. estrogen regulated 


45 


1472 330 


2.1 


134485 X82153 Hs*3942 caihepsb K (pyenodysosiosb) 


345 


411 


12 


5.1 


134G54 AKD01741 H*8739 hypo8»e&al protem RJ10879 


6.4 


64 


1 


5.1 


134880 AI879195 Hs^90G06 15 KDa selenoprotein 


5.7 


57 
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135029 H58818 Hs.187579 hydraxysteroJd (17-beia) dehydrogenase 7 


115 


115 
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10 


135389 U05237 H&99872 fetal Abhekner antigen 


20.6 


206 
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19.1 


128305 AB54968 Hsu279009 matrix Gta protein 


9.4 
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TABLE 11A 



Table 11 A shows the accession numbers for those pkeys lacking unigenelD's for Table 1 1 . 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession'* column. 

10 

Ptey: Unique Bos probeset Mentffier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 

15 

Pfcey CAT number Accession 

123519 371681 1 AA602964 AA609200 
20 113702 genbankJT97307 T97307 
114988 genban)UU251089 AA251089 
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TABLE 12: Figure 12 from BRCA 001-3 PCT 



Table 12 depicts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 



Pkey: Unique Eos probeset Henffiter number 

10 ExAcav Exemplar Accession number, Genbank accession number 

UnbenelD: Unlgene number 

UnfceneTtfle: Umgenegeneffite 

R1: Rate of tumor br>onral body tissue 

R2: Ratfctf90*percent3etujiwtobody 
IS R3: Rate of 75*percenEe body to tumor 

HA: Ra& of bimorbnorniaJ breast fesue 



20 



25 



Pkey ExAccn UnlgeneD Untgene Title 


R1 


R2 


R3 


R4 


100131 012485 Hs.11951 phosptoflesteiase 1 (PC-1) 


13.2 


244 


19 


9.9 


105500 AWS02166 Hs^22399 ESTs 


25.4 


508 


20 


3 


112244 AB029000 Hs7B823 WAA1077 protein 


SJ 


567 


100 


6.7 


114124 W57554 Hs.125019 ESTs 


245 


242 


10 


5.6 


119771 A1905687 Hs^33 ESTs 


ZS 


2073 


595 


2.1 


121723 AA243499 Hs.104800 ESTs 


Z9 


214 


74 


3.7 


128790 AP02GG92 Hs.105700 secreted frfezled-reteted protein 4 


17.4 


409 


24 


7.8 


131148 AW953575 Hs303125 ESTs 


33 


565 


153 


3.7 


131985 AA503020 HsJ36563 ESTs 


40.2 


402 


1 


4 


133199 AF231981 Hs250175 Homo sapiens done 239MmRNA sequence 3 


816 


275 


3.9 
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TABLE 13: Table 1 from BRCA 001-5 US 
5 Table 13 depicts a preferred group of genes upregulated in breast cancer cells. 



Pkey: Unique Eos probesdUeiiffiarninTibef 

10 ExAccn: Exemplar Accession number, Genbanx accession number 

UrogenelD: Ungene number 

UnfeeneTifle: UnfeenegenetifJe * 

R1: Ratb of tarnvvto normal body tissue 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



Pkey ExAccn UniGene 8) Unlgene Title 

100038 M97935 control 

100039 M97935 control 

100040 M97935 control 
1G0041 M97935 control 

100082 AB003103 Hs.4295 proteasoma (prosome; rracropain) 26S sub 

100091 AF000177 Hs.111783 Lsm1 protein 

100100 AFD06084 Hs.11538 acfin related protein 2Q complex; subunx 

100103 AF007875 Hs3085 ooflchyHitosphate mannosyttransferase p 

100114 D0Q595 Hs32962 Br/nifciytatesynftetase 

100121 D10495 Hs.155342 pioteh kinase C;deda 

100123 D10S23 Hs.168669 Qxoghrtarate dehydnxjenase (Gpoairode) 

100128 D11094 Hsj611S3 protBasorna {precoma: rracropaln) 26S sub 

100131 D12485 Hs.11951 phosphodiesterase lAnxieotide pyrophosp 

100137 D13627 Hs.15071 chapenwlnccflialr^Tr^ 

100144 013643 Hs.75616 Hun^mRNA^KlAA0018p^r)8;coirip 

100147 D136GS Hs.136348 ostaob^spedte factor 2 (fasckilnHto 

100154 D14657 Hs31892 KWA01O1 gene product 

100164 014812 Hs.173714 MORF-retatedgeneX 

10)169 D14878 Hs32D43 D123 gene product 

100190 D21090 Hs.178658 RAD23 (S. cerevlsae) homotog B 

100203 D2>538 Hs.172199 adenylate cyclase 7 

100205 026308 Hs.76289 bfflwrtDn reductase B (Bavh reductase {N 

100215 D26598 Hs32793 pp te asome (prosome; nracropah)subun8 

100216 026599 Hs.1390 pn>teasQfne(piDSon^rna£io|^ 
100219 028137 Hs.118110 bor£rrBrrowsbi^ceQanQgen2 
100227 D28915 Hs32316 hterfBronWuoed; hepamcassodated 
100248 031888 Hs.78388 KIAAC071 protein 

100287 D43950 Ks.1600 chapen^amtaWrgTT^;siJbw*5(e 

100294 D49396 Hs.75454 arrttoxldart protein 1 

100307 D50525 Hsj699 hypottedcal protein 

100335 D63391 Hs.6793 pfatetekriwating rato^ 

100340 D63487 H&82563 WAAD153 protein 

100355 078129 Hs.71465 Homo sapiens mRNA for squaleneepoxld 

100363 D78514 H&78563 ublaufflrKXinfugaeng enzyme E2G1(hom 

100368 D79987 Hs.153479 extrasi^palesacerevlslaeihcxrnto 

100372 D79997 Hs.184339 WAA0175 gene product 

100375 080004 Ks75909 WAAM82r*oteiri 

100379 082060 Hs378721 Ke4 gene; niouse; human homoJogof 

100387 D83777 Hs.75137 K1AA0193 gene product 

100393 D84145 Hs3S913 iwvelRGD<x)nta^r>roteh 

100398 D84557 Hs.155462 mmUtronxsome mairterarce deficient (rn 

100405 D86425 Hs32733 nWogen2 

100406 D86479 Hs.118397 AE4Mng protem 1 
100409 086957 H&80712 WAAQ2Q2 protebi 

100421 D86985 Hs.79276 Human rr^taKlAA0232gerre;coinp 

100446 087464 Hs.10037 WAA0274gene product 

100447 087465 Hs.74583 WAA0275 gene product 

100448 D87469 Hs37652 EGF-RkeKtomafru muffipte 2 
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100467 D89052 HsJ476 ATPase; rf* transporting; lysosomal (vacu 75 

100468 D89289 Hs.1 18722 fwxjsyttran^^8(aIpha(1^)fucosyftr 5 
100486 HT1112 Hs.10842 Ras-Uke Protein Tc4 163 
100497 HT1400 Hs.79137 Carboxyl MeaiyHransfaase, Aspartate, A 5.6 

5 100618 HT2710 Hs.114599 G3Bagen 1 TypeV« > Alpha 1 75 

100661 HT3018 Hs.132748 R3)osomal Proton L39Homolog 4.4 

100667 HT3127 Hs.169610 Epfcan, At SpBce 11 4.6 

100668 HT3938 Hs.169610 Epfcan, Alt Spfce 12 4.4 
100676 HT3742 Hs^87820 FBxonedkwAK.SpBcel 9 

10 100m KT26388 .Hs596Q3 Mucin 1, EpMat Ak. SpBce 9 47 

100783 HT4018 Hs.191356 Baste Tianscrip&n Factor, 44 KdaSubun 13.7 

100829 HT4343 Hs278544 CytosolicAcsetoace^enzymeAm 10.6 

100830 HT4344 Hs.4756 Rad2 55 
100840 HT4392 Hs.183418 Protein Kinase Pitslre, Alpha, AIL Sptice 4.1 

15 100850 HT417 H&297939 CaftepslnB 4 

100866 HT4582 Hs.75113 Transcripfon Factor Ifa 43 

100306 HT5158 Hs5398 GualKlS^5 , -^to^KJphosphateSy^^ 6.7 

100914 HT511 Hs324178 RastnhlbaDrtnf 72 

100916 HT544 Hs.73946 EndolheBal Cd Gronrth Factor 1 53 

20 100945 HTB84 Hs.180686 Oncogene E&AfcPapBtonwtrus 45 

100975 J02923 Hs.76506 lymphocyte cytosoBcprotefri 1 (L-ptestin 30.1 

100988 J03589 Hs.76480 ub*iuiflrHB®4 8.3 

100996 J03909 Hs.14623 toten^ganruhfoA^ 63 

100999 J03934 Hs30706 oTaphorase (NADH/NADPH) (cytDchrom 43 

25 101011 J04430 Ks.1211 acid phosphatases; tartrate resistant 53 

101017 J04599 Hs321 Wglycan 5.1 

101031 J05070 Hs.151738 matrix rrietaDoprote^ 372 

101038 J05249 Hs.79411 rep8r^protebA2{32H)) 6.1 

101054 K02405 Ks.73931 Hurm^dassDHLA^&betarrtRN 43 

30 101061 K03515 Hs.180532 glucose phosphate isomerase 43 

101091 L06132 Hs.149155 voftage^ependert anion channel 1 7.4 

101097 106797 H&89414 chernoWne (C-XC motS); receptor 4 (fus 4.6 

101104 LB7615 Hs.169266 Human nsuropepftteY receptor Y1 (NPY 183 

101143 L12723 H&90093 teat shock 70W>protefri 4 17.4 

35 101152 L13800 Hs38&4 Homo sapiens ttrer expressed protein gen 7.6 

101183 L19779 Hs.795 H2AWstDnefemfly; member 0 103 

101216 L25876 Hs34113 cycflwtependenltese tehfoBor 3 (CDK 7.4 

101233 L29008 HsJ78 sorb^aehydrogenase 145 

101247 L33601 Hs.78802 glycogen syntoeWnase 3 beta 75 

40 101282 138810 Hs.79387 proteasome (prosome; macropaln) 26S sub 4.4 

101326 L42572 Hs78504 ktnerrnetrijr^p(o^nmho(^(m 53 

101332 L47276 Hs.156346 Hon» sapiens (eel BneHL-6) alpha toco 183 

101346 L77213 Hs30954 phosprwrnevatonate kinase 75 

101352 L77701 Hs>16297 COX17(yeas0homolog;cytochro^ 93 

45 101378 M13755 Hs333 Herfsrawfimulated protetn;15kDa 18.1 

101395 M15796 HsJ8S96 prc«era^cel«ictearan^en 8.6 

101404 M16342 Hs.182447 hetetogefieousnuctearrttonuc^^ 45 

101439 M20902 K&268571 apojjwp rote fo 04 6.1 

101464 M22538 H&51299 NADHcfeftyoYoBenasef^^ 8.7 

50 101469 M22877 Ks.169248 Humaisorraflc<^tochrc<riec{HCS)p^ 42 

101472 M22960 Hs.118126' r^teotoprotBtotcfbeta^ 65 

101478 M23379 Hs.758 RASp21 pfotehacflvator(GTPaseactiva 14 

101484 M24594 H&20315 iriten%cf)4n^pi^en55 92 

101539 M30818 Hs326 myxovtrus(iri8uerrza)r^^ 5.1 

55 101540 M30938 Ks34981 Xray repair oarapbfnenflng defecfoerep 4.7 

101544 M31169 Human propbnyWoA cartweytasebetw 55 

101552 M31642 Hs32314 rrypcotanfliineprwsprBDrttx^ 85 

101580 M34677 Hs33363 DNA segnterrton ctoomoscroe X (unique) 45 

101600 M37583 Hs.1 19192 r^Wstor»tarn»f r member Z 5.7 

60 101663 M6075O KsJ2178 H2Bhistof»(ari}^ member A 5.8 

101664 M60752 Hs.121017 H2AWs*one1amilr. member A 135 

.101667 M60858 Hs.79110 micteoKn 4 

101684 M63256 Hs.75124 (^ebeDardeger^fion-retated protein (62 7.6 

101702 M64929 Hs.179574 protetophc^halase2(for^ 42 

65 101754 M77142 H&23M89 7W1 cytotorfcpaiiul^^ 45 

101758 M77836 Hs.79217 pyncfee-S^arboxyiate reductase 1 57 

101767 M81057 Hs. 180884 cartxoypepiJdaseBI (tissue) 21.7 
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106664 AA460969 Hs.7510 mitogervacfivated protein kinase kinase Id 

106698 AA463745 Hjl29403 ESTs; Weakly similar to PROBABLE AT 

106719 AM65171 K&236844 ESTs 

106726 AA465339 Hs.3888 ESTs 

5 106747 AM76473 Hs.171957 . tripfefuncBonai**^ 

106759 AA477263 Hs755B4 ESTs 

106765 AM77717 Hs306117 tnterteukm 13 recepton 1 

106784 AA478558 Hs^27913 AP&fltel 

106831 AA482014 HsJ»463 centrirr, ff-hand protein; 3 (COC31 yeast 

10 106836 AA482112.H&238707 ESTs 

106840 AA482548 H&5534 ESTs 

106856 AA486183 H&285123 ESTs; Weatysirrto to state to oxyste 

106865 AA487228 Hs.19479 ESTs 

106878 AA488872 Hs.12314 Homo sapiens mRNA; oDNA DKFZp586 

IS 106888 AA489101 H&24734 axystenA blndfcg protein 

106895 AA48S665 H&25245 ESTs 

106909 AA490323 rfe250747 SUM01 acfivaflng enzyme subunftl 

106919 AA490885 Hs^1766 ESTs 

106920 AA490899 Hs296323 ESTs 
20 106941 AM96204 Hs^37971 ESTs 

106942 AA496347 Hs31314 lefirmblastoma^imlirig protein 7 

106948 AM96788 Hs£1C77 WAA0532 protein 

106968 AA504631 Hs^6813 ESTs; WeaWysfrri&rtDhypo8iefical412 

106973 AA505141 Hs.11923 Human DrMsequerxs from done 167A1 

25 106980 AA521121 HsJ858 brornodortBha^cenltozinctogerckjm 

106981 AA521157 Hs.74101 ESTs 

106998 AA598461 Hs.195464 tosufotte growth fectorbfotfng proton 

107008 AA598710 Hs£5740 ESTs 

107028 AA599214 Ha24143 ESTs 

30 107032 AA599472 Hs^47309 succtoatfrCoA fease; OT-fcrrring; beta 

1070S2 AA600134 Hs.12482 glycerorephosphateO-acyto^ 

107063 AA600147 Hsl5741 ESTs; Vteak^sirn^toNADH^^ 

107056 AA600310 Hs.18720 propjanHiiedcelo^8(apopto5is^ttluc 

107080 AA609210 Hs.19221 ESTs 

35 107102 AA609723 Hs30852 ESTs 

107109 AA609943 Hs32793 ESTs 

107129 AA620553 KM756 ^structure^pec^Q)doriuc)ease1 

107132 AA620598 H$4Q52 ESTs 

107136 AA620795 Ha.8207 ESTs 

40 107140 AA620889 Hs.170088 ESTs 

107151 AA621169 Hs3687 ESTs 

107159 AA621340 Hs.10600 ESTs; WeaMy shnbr to ORFYKR081c[ 

W174 AA621714 H&25338 ESTs 

107217 D51095 Hs35861 DKFZP586E1621 protein 

45 107252 059971 HsJZSSZS ESTs 

107295 T34527 Hs30120 UDP^^aosiytelpha^^^ 

107299 T40327 Hs3Q661 lung resistance-related protein 

107324 T81665 Hs778422 DKFZP586G1122 protein 

1 07372 UB5625 Hs3297 ribonudease 6 precursor 

50 107373 U85773 Hs. 15469 5 presphofnawwrrartase 2 

107481 W58247 Hs27S766 HcfrcsapteralonBslnsuperfamilym 

107531 Y13936 Hs.17883 protetapr)0spha1as8l6(forrr^2CXma 

107859 AA024835 Hs.47584 rxfessiumvoftagB^atedchar^ 

107890 AA026030 Hs.61311 ESTs;WeaJdyslm3artoCALPAIN2;LA 

55 107908 AA026894 Hs.42826 ESTs 

108039 AA041341 Hs.46670 ESTs 

108040 AA041551 Hs.159971 ESTs 

108102 AA046424 Hs.49433 EST^V^simBartoHYParHETICA 

108217 AA058686 Hs.62588 ESTs 

60 106255 AA063157 Hs.172608 ESTs 

108358 AA071514 Hs.1634 ESTs 

108809 AA100694 Ks.69499 Hurr^Dr^sec^erK8fromBAC15E1 o 

108647 AA112396 Hs.44276 ESTs; Moderately simfiar to HOMEOBO 

108676 AA115562 H&274417 Rc™sapkrarr^cDMDKFZp564 

65 108687 AA120785 Hs£4347 ESTs 

108695 AA121315 K&.70823 KWA1077 protein 

108733 AA126422 zn84f1.s1 Stratagene king cardnoma 9372 
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108774 AA128125 Hs.71040 ESTs; Moderately similar to CELL GROW 

108828 AA131584 Hs273344 DKFZP56400463 protein 

108872 AA134063 Hs.111680 ESTs 

108884 AA134958 H&2S3591 ESTs 

5 103893 AA135894 Ha.194691 refooteacW Induced 3 

109008 AA156360 Hs.87128 ESTs 

109010 AA156460 Hs.44229 dual spedffcfly phosphatase 12 

109011 AA156542 Hs.72127 ESTs 

109042 AA159525 H&71779 Homo sapiens DMA bom chromosome 19 

10 109086 AA166695 ,Hs270737 tumor necrosis factor ($gand)supeifcmfty 

109090 AA1G70Q6 Hs.70499 ESTs 

109101 AA167708 Hs32184 ESTs 

109112 M169379 Hs257924 ESTs 

109160 AA179387 Hs301997 DKFZP434N126 protein 

15 109186 AA179845 Hs.73625 RAB6 hteracttog; WnesMte (rabkfoesin 

109178 AA181600 H&2B3707 ESTs 

109179 AA1819Q2 Hs.192789 ESTs; WeaWy sMar to Ml ALU SUBFA 
109261 AA135255 Hs31779 ESTs 

109270 AA195515 Hi3585 ESTs; WeaWy similar to afierrafively spfi 

20 109277 AA196332 Hs36043 ESTs 

109313 AA206800 Hs36276 ESTs; ModefateJyshto to zinc finger p 

109415 AA227219 Hs.110826 trwJcteofide repeat coritaWng9 

109454 AA232255 KsJ295232 ESTs 

109467 AA232S04 Hs331S7 ESTs 

25 109481 AA233342 H&289069 ESTs; Weakly shabr to WD40 protest C 

109508 AA2338S2 H&559Q2 ESTs; WeaWy simflar to !Ui ALU SUBFA 

109514 AA234Q87 H&262346 ESTs; Weakly shribr to ORF2: function 

109572 FO2027 H&171937 *" ESTs 

109632 F04165 Hs235873 ESTs; Weakly sirafer to K11C42 [Oeteg 

30 109844 F04477 H&291531 ESTs; Moderately sbnfar to GLYCERAL 

109703 F09684 Hs£4792 ESTs; WeaWy similar to ORF YOR283W 

109726 F10009 Hs3196 ESTs 

109747 F10161 K&22969 ESTs 

109799 F10770 Hs.180378 Homo sapiens done 669 unknown mRNA 

35 109814 F10979 Hs.153106 Homo sapiensdone 23728 mRNAsequen 

110189 H20543 Hsl6278 DKFZP586B1621 profe*) 

110240 H25577 Hs.176588 ESTs; WeaWy sknilar to CYTOCHROKE 

110280 H29285 Hs32468 ESTs 

110520 H56965 Hs.4082 yr09f06.s1 Scares fetal Ever spteen 1hR 

40 110561 H59617 Hs.5199 ESTs;WealdysWlartoUBKWriKCO 

110707 H95079 Hs.15617 ESTs; WeaWy sNbr to Jill ALU SUBFA 

110734 H98714 Ha24131 ESTs 

110770 N22262 Hs.131705 ESTs 

110780 N23174 H&22891 sc4ito canter famDy 7 (cafionfe amino act 

45 110787 N24716 Hs.12244 ESTs; WeaWy simfar to C44B9.1 fCeteg 

110794 N25262 H&27S31 ESTs 

110799 N26101 Hs323401 Human rfogzmc^nnw parish (ZNF127- 

110818 N29454 H&27552 ESTs; WeaWy amBar to putafivep150|H 

110839 N30856 H&3Q246 sotote carrier fantfy 19 (Bdarrtf^ 

50 110844 N31952 Hs.167531 Homo sapiens niRr^lullengm Insert cD 

110854 N32919 Hs27931 ESTs 

110856 N33063 ESTs; WeaWy similar to S164 [Usaptens 

110860 M33436 Hs.170055 ESTs 

110897 N39148 Hs3880 DKFZP434D156 protein 

55 110915 N46252 Hs29724 ESTs 

110935 N48787 Hs305979 DfDb^lrihQte1(anfrelastaseka^ 

110970 N51374 Hs36870 Homo sapiens rnRNAfoOlengttitreertcD 

111008 N53375 Hs.166146 Hoir^iieuror^Bnmeo3ateeaTV9ene;3 

111008 H53388 HsJ222 ESTs 

60 111018 N54067 Hs3628 nTtogen-aeflvated protein kinase kinase ki 

111084 N59543 Hs.15456 POZ dwnain containing 1 

111100 K62522 Hs20450 ESTs 

111125 N63823 Hs269115 ESTs 

111132 1484378 Hs33293 ESTs;H^sln^tounkr^fiffK8on 

65 111139 N64683 Hs290943 ESTs 

111164 N6S857 Hs.14809 ESTs; WeaMydrnfer to D» ALU CLASS 

111172 N67102 Hs21851 Homo sapiens mRNA; cDNA DKFZp586 
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111178 N67227 Hs24633 ESTs 

111179 N67239 Hs.10760 ESTs 

111181 N67278 H&171802 ESTs; WeaWy sfoiflar to hypotteflcal pro 

111184 N67437 HsJ£43901 Homo sapiens mRNA; cDNA DKFZp564 

S 111221 N68869 Hs.15119 ESTs 

111223 N68921 H&297939 ESTs; WeaWy sfmflarto neogenki (Rsap 

111229 N69113 Hs.110855 ESTs 

111241 N69514 Hs^88880 ESTs; Weakly simflar to CG1-S2 protein [ 

111268 W70481 Hs£611B Homo sapiens done 24766 mRNA sequen 

10 111295 N73275. Hs21275 ESTs; WeaWystmSartoubtquffin-cof^ug 

111299 N73808 Hs^4936 ESTs 

111336 N79565 Hs^9894 ESTs 

111357 N91Q23 Hs.87128 ESTs 

111370 NS2915 Hs.94631 brefekfin A-mhlbited guanine nucleotides 

IS 111806 R33468 H&279008 ESTs 

11182S R3S885 Ksl286148 stroma* antigen 1 

111836 R36228 H&25119 ESTs 

111890 R38678 Ha.12365 ESTs 

111923 R39995 Hs.25925 Homo sapiens done 23860 mRNA sequen 

20 111942 R40576 Hs£1590 ESTs 

111987 R42036 Hs.6763 WAA0942 protein 

112101 R447S3 Hs^96341 adenytyi cydase^ssoriated proteki 2 

112134 R46025 Hs.7413 ESTs 

112197 R49482 Hi5637 ESTs 

25 112244 R51309 Hs.70823 WAA1077 protein 

112253 R51818 Homo sapiens mRNA; cDNA DKFZp566 

112305 R54822 Hs36244 ESTs 

112449 R63802 Hs.124186 ring finger protein 2 

112483 R66534 Hs285885 ESTs 

30 112519 R68631 Hs.11861 ESTs 

112610 R79392 Hs33643 ESTs 

112693 R88741 Hs31065 ESTs; Moderately simfiar to profiferatJon 

112751 R935C7 Hs.8207 ESTs 

112801 R97486 Hs.157160 protein Wnase; DNA^cfivated; catalyfic p 

35 112869 T03313 Hs.4747 dyskeratosis congenfe 1;dyskerm 

112871 703352 Hs.12285 ESTs 

112908 T10065 Hs3530 TIS-asscxia^senne-arginiiepfD^ 

11266 T17119 Hs.102548 glucocorticoid receptor DNANndfngfact 

112971 T17185 Hs33883 ESTs 

40 112995 T23528 Hs.7155 ESTs; WeaHysimSar to 7YN protein [M 

113047 T25867 Hs.7549 ESTs 

113075 T34660 Hs.6986 ESTs; WeaMy stmBar to till ALU SUBFA 

113117 T47819 Hs.159153 ESTs 

113206 T58044 K&241471 ESTs;McderatelyslmftartoDUALUSU 

45 113248 T63857 yc16eU1 aratagene lung ^3721) Homo 

113260 TS4896 Ks37420 ESTs 

113277 T65797 Hs.11774 pmtem tjpepfioV4»oy cfcftrarefeomeras 

113278 T65602 Hs.11135 yc11h10.s1 Strategene hmg (S937210) Ho 
113440 T86121 Hs.191445 ESTs 

50 113523 T90037 HS35549 ESTs 

113604 T92735 Hs£96083 ESTs 

113702 T97307 ESTs; Moderately stmBar to til! ALU SU 

113783 W19222 Hs.7041 ESTs; Weakly similar to UU ALU SUBFA 

113794 W37382 Hs.11090 ESTs 

55 113808 W44735 Hs5286 ESTs 

113811 W44928 H&6994 ESTs 

113822 W473S) Hs.17466 reflncfcaddrecertoresponderfjazaro^ 

113823 W47388 H&55099 rab6GTT^acflva^ pmtB^(GAPand 
113836 W56792 Hs.12040 ESTs; Weatiy srntfar to WAA0881 prote 

60 113857 W65477 H&5297 HomosapJeremR>^cONADKFZi)564 

113886 W72471 Hs23920 ESTs 

113895 W73738 Hs.12921 ESTs 

113923 W80763 Hs.3849 ESTs; WeaMy similar to FK5084Mngp 

113931 W81205 Hs3496 ESTs 

65 113950 W85765 Hs3Q504 Homo sapiens mRNA; cDNA DKFZp434 

113970 W66748 Ks.6109 ESTs 

114051 W94942 Hs.177534 dual specificity phosphatase 10 
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114799 


AA159323 


Hs.109929 


ESTs 


4.2 


114804 


AA16G363 




ESTs 


4.8 


114811 


AA161161 


Hs.95907 


mulfijplB inositol polyphosphate phosphate 


7.1 


114821 


AA165313 

on iw*/v IV 


H&55468 


ESTs 


4.4 


114852 


AA235D35 


Hs38260 


ESTs; Moderately sfrnflarto tiHgitttrn spa 


5 


114901 


AA236276 


Hs.196437 


ESTs; Weakly stmBar to R26660J; parti 


165 


114902 


AA236359 


Hs39504 


ESTs 


5.1 


114940 


AA243012 




ESTs 


8.5 


114965 


AA250737 


Hs.72472 


ESTs 


35.1 


115047 


AA252627 


Hs32916 


ttomeo hoot B5 


5.7 


115(64 


AA252863 


Hs37729 


ESTs 


62 


115061 


AA253217 


Hs.41271 


ESTs 


13 


115082 


AA255557 


Hs.198269 


NADH dehydrogenase (ubiquinone) 1 alp 


282 


115116 


AA256486 


Hs.62275 


ESTs 


83 


115140 


AA258030 


H&279938 


ESTs; Weakly similar to supported by 6E 


4.1 


115205 


AA262470 


Hs384216 


ESTs 


83 


115206 


AA262491 


Hs. 186572 


ESTs 


5.1 


115239 


AA278650 


Hs.73291 


ESTs; Weakly stouter to stmflar to the bet 


4.6 


115242 


AA278755 


Ks283732 


ESTs 


83 


115249 


AA27&961 


Hs.71124 


ESTs 


10.1 


115259 


AA279071 


rfe/13453 


spfclng factor 3b; subunit 1; 155kD 


9.5 


115285 


AA279799 


N&293736 


ESTs 


53 


115291 


AA279943 


Hs.122579 


ESTs 


5.1 


115357 


AA281793 


Hs.72988 


ESTs 


5 


115377 


AA282247 


Ks.193063 


ESTs 


6.1 


115400 


AA283198 


Hs39113 


ESTs 


4.9 


115439 


AA284561 


Hs.193090 


ESTs 


53 


115471 


AA287138 


Hs£3346 


ESTs; Weakly simflar to ASPARTYL-7R 


H7 


115506 


AA292537 


Hs.45207 


Human DNA sequence ftom done 62DE1 


63 


115522 


AA331393 


Hs.47378 


ESTs 


53 


115572 


AA398392 


Hi59594 


ESTs; Weakly stater to F33G 123 gene p 


9.7 


115587 


AA399264 


Hs£83G37 


ESTs; High* sbwbrtoHSPC039 protein 


8.7 


115600 


M400247 


Hs.42173 


ESTs 


4 


115612 


AA400948 


Hs.71243 


ESTs; WeatystoiOartozfcc finger prate 


8.4 


115646 


AA404352 


Hs305971 


ESTs 


53 


115652 


AA405098 


Hs38l78 


ESTs 


16.1 


115657 


AA405620 


H&55158 


ESTs; Wea% simBar to weak sbnflanTy t 


4.7 


115658 


AM05625 


Hs.183056 


Human DNA sequence ftom done 34B21 


5.1 


115675 


AA406546 


Hs32D65 


Homo sapiens mRNA; cDNA DKFZp564 


205 


115721 


AM17102 


Hs3Q960 


ESTs 


43 


115763 


AA421560 




ESTs 


7 
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115764 AA421562 Hs31011 anterior gradient 2 ptenepus taevis) homo 

115835 AA428576 H&41371 ESTs 

115844 AA430124 Hs.7773 ESTs 

115675 AA433943 Hs.43946 ESTs; Weakly sinBar to Weak similarity 

5 115888 M435839 rte.76591 KIAA0887 protein 

115922 AA441911 Ha.71869 ESTs; Weakly simBar to WAA0926prote 

115941 AA443602 Hs.46679 ESTs 

115947 AA443793 Ha54761 ESTs 

115948 AA443798 Hs.43445 poly(A>«pec^r&xmuctoa68(dead8^ 
10 115951 AM43918 Ha301048 coffin 1 (rawnuscte) 

115967 AA446887 Hs.42911 ESTs 

115984 AA447687 H&91109 ESTs 

116009 AA449448 Hs.44238 ESTs 

116024 AA451748 Hs33863 Human DMA sequence from done 718J7 

IS 116028 AA452112 Hs.42644 Ihtoretadnte 

116050 AA453656 Hs38417 ESTs 

116097 AA456099 Hs.176376 ESTs 

116108 AA457566 H&28777 ESTs 

116121 AA459254 Hs.48855 ESTs 

20 116127 AA4597Q3 H&279884 WTtycav^myelGcytomat^ vtralonco 

116129 AA459956 Hs.49163 ESTs; Hi^sim3artoputaflveribonude 

116142 AA460649 Hs39457 ESTs 

116204 AA465701 Ks.108646 ESTs 

116221 AA478397 Hs50180 ESTs 

25 116222 AA478415 Hs39986 ESTs 

116238 AA479362 H&47144 DKFZP586N0B19 protein 

116246 AA479961 Hs>250646 ESTs; Highly simDar to ub^itirKanjuga 

116249 AA480886 K&66693 ESTs 

116250 AA480975 Ha44829 ESTs 

30 116254 AA481146 Hs.41086 ESTs; Weakly similar to OXYSTEROl-B 

116256 AA481256 Hs382D1 ESTs; Weakly sfciar to rysophosphoCpa 

116264 AA482594 Hsl272239 Homo sapiens mRNA;cDNADKFZp586 

116265 AA482595 Hs55189 ESTs; WeaWy simfer to F25B53 [£eteg 
116282 AA486550 Hs204501 ESTs; WeaWy sfnto to Wlskc^AJdrfch 

35 116298 AA489046 Hs34109 ESTs 

116300 AA489194 Hs.159471 ESTs; WeaWy simBar to snRMP protein B 

116327 AA490959 H&28005 Homo sapiens rnf^cONADKFZp564 

116334 AM91457 Hs.48948 ESTs 

116337 AA4961Z7 Hs.44070 ESTs 

40 116351 AA504116 H&32501 HonjosapJersrr^cDNADKFZp434 

116357 AA504806 Hs30797 Homo sapiens clone 23620 mRNAsequen 

116415 AA609204 Hs27973 KIAA0874 protein 

116443 AA620313 Hs.190488 ESTs; WeaWy sfanDar to KERATIN; TYP 

116470 C13992 H&33484 ESTs 

45 116480 C14088 p^cefakfehyo»3i)hosphate dehydrogena 

116578 D51272 Hs.75337 nudedarphosphopftdeinp130 

116579 D51276 Hs31915 leukemia^ssodaied phosphoprotEtn P 18 
116826 FQ2Q28 Hs51907 ESTs 

116647 FO3069 Hs.15395 ESTs; Weakly sm2ar to ARGINYL-TRN 

50 116674 F04816 Hs32127 ESTs 

116680 FD8813 H&273829 UNEretn^ransposabfeeEenientl 

116700 F09983 H&317589 ESTs 

116724 F13665 Hs55641 ESTs 

116726 F13681 H&53913 ESTs 

55 116732 F13779 Ha.165909 ESTs 

116734 F13789 Ha33796 DKFZP586D2223 protein 

116760 H11054 Hs.155342 proteto ktoase C; ob&j 

116780 H22566 Hs30098 ESTs 

116786 H25836 Hs301527 tumornecrosb factor (Bgartd) superfaraSy 

60 116787 H28581 Hs.15641 ESTs 

116790 H29532 Hs.101174 mterotutwle^ssociated protein tau 

.116803 H47357 Hs.109701 ESTs; ModerateV simDar to weak shnSar 

116877 H68116 Hs.168732 ESTs 

116921 H72948 Hs321 btyycan 

65 117216 N20083 Hs.42792 ESTs 

117232 N20579 Hs.61153 ESTs 

117284 N22162 Hs.183779 ESTs; WeaWy similar to cONA EST yk33 
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117344 N24048 


Hs210706 


ESTs 


7.4 


117367 N24954 


Hs.42502 


ESTs 


103 


117382 N26175 


K&93405 


ESTs 


53 


117394 N26257 


Hs39S71 


KIAA0727 protein 


8.4 


117412 N26722 


Hs.42645 


. ESTs 


18.1 


11749B N31726 


Hs.44268 


ESTs; Highly slmHar to myelin gene expr 


5.8 


117557 N3392Q 


Hs.44532 


dlubkyjltin 


123 


117634 K36421 


Ks.13323 


ESTs; WeaMy sfrt5ar to SODIUM- AND 


4.4 


117639 N36923 


Hs.44833 


ESTs 


6 


117754 N47469 


H&59757 


ESTs 


73 


117852 N49408 


Hs.136102 


KJAA0853 protein 


S3 


117879 K50050 


Hs303Q25 


ESTs; Weakly sfon&ar to keratin; 67Ktyp 


73 


117924 N51066 


Hs38891 


ESTs 


7.9 


117950 N51394 


Hs.75478 


WAA0956 protein 


5 


117992 N52000 


Hs.172089 


Homo sapiens mRNA; cONA DKFZp586 


7 


118138 N57773 


Hs.93560 


ESTs; Weakly staffer to trg (Rjwrveglcu 


4.6 


118215 N62195 


Hs.77910 


34rydnacy-3-me£iylg!uta^^ 


13.4 


118229 N62339 


Ha.166254 


heat shock 90kD protein 1; alpha 


5.4 


118265 NS2827 


Hs.46645 


EST 


A2 


118336 N63604 


Hs.47166 


ESTs 


72 


118363 N64168 


Hs.48938 


ESTs 


6 


118429 N66158 


Hs.74649 


ESTs 


4.1 


118470 N66769 


H&291033 


ESTs 


5.4 


118472 NG6818 


Hs.42179 


ESTs 


108 


118475 N66845 




ESTs; WeaHy similar to )!!! ALU CUSS 


43 


118493 N67149 


H&50115 


ESTs 


53 


118528 N67889 


Hs.49397 


ESTs 


10.4 


118542 N68010 


Hs.49427 


ESTs 


7.9 


118600 NG9222 




ESTs 


9.2 


118695 N71781 


Hs30081 


Homo sapiens mRNA fuB length Insert cO 


9.8 


118698 N72113 


Hs30187 


ESTs 


43 


118901 N90719 


H&94445 


ESTs 


ai 


118952 N92966 




ESTs; Highly simSar to CGI-90 protefn [H 


123 


118976 N93629 


Hs.93391 


ESTs 


5 


118986 N94362 


Hs.125830 


ESTs 


73 


118989 N94439 


Hs.45105 


ESTs 


B2 


1190Z7 N99256 


Hs.114611 


ESTs 


5 


119042 R05316 


Hs-5472 


ESTs 


4 


119075 R36451 


Hs\287820 


fibronecfin 1 


6 


119260 T15916 


Hs.102950 


ESTs; Highly similar to coat protein gamm 


4.1 


119Z71 T16387 


Hs,65328 


ESTs 


1Z1 


119298 T 23820 


Hs.155478 


cycfinT2 


5.6 


119302 T25725 




ESTs 


143 


119341 T62571 


Hs.146388 


rnkTDtubuto-dssodated protein 7 


4 


119495 W3S390 


H9L55533 


ESTs 


53 


119580 W42451 


Hs.92260 


Wgh-mobfKy group proteh2-Gte 1 


5.6 


119602 W462B6 


Hsl233694 


ESTs; Weakly similar to ZK1058.5 (Cele 


63 


119620 W47620 


H&56009 


Z-ffofigoadenytate synthetase 3 


8.1 


119676 W60473 


Hs57787 


ESTs 


S3 


119717 W69134 


Hs57987 


ESTs 


4.6 


119729 W69747 


HsS4806 


WAA1062 protein 


4 


119805 W73788 


Hs.43213 


ESTs 


4 


119859 W80702 


H&38461 


ESTs 


4.8 


119867 W80852 


Ks^50696 


KDEL (Lys-Asp-Gto-Uu) encoplasmte re 


42 


119873 W81129 


Hs.44865 


Honx>sap^mRNA;cDNADKFZp588 


A3 


119899 W84767 


Hs38698 


ESTs 


S3 


119940 W86779 


K3272531 


DKFZP586BQ319 protein 


9 


119943 W86835 


Hs.14158 


copinelD 


4.8 


119970 W87812 


HsS3581 


Homo sapiens mRNA; cDNA DKFZp5B6 


4 


120131 Z38656 


Hs.75887 


ooatomer protein complex; subunft atyha 


A2 


120150 Z39549 


Hs.153746 


ESTs 


11 


120206 Z40805 


Hs.91668 


ESTs 


8.2 


120241 Z41815 


Hs.65946 


ESTs 


153 


12QS5 AA169752 


H&5672 


ESTs; WeaMy sirnter to StmDaifly to Yea 


A2 


120314 AA194166 Hs^21040 


WAA1038 protein 


6.8 


120325 AA195651 


Hs.104106 


ESTs 


15.2 


120352 AA211400 Hs.193172 


ESTs 


6.8 
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120428 
120524 
120528 
120571 
120649 
120655 
120668 
120712 
120713 
120724 
120873 
120865 
120919 
120948 
120989 
120977 
121103 
121291 
121320 
121463 
121596 
121723 
121748 
122125 
122522 
122655 
122704 
122782 
122856 
122882 
122928 
122974 
122997 
123016 
123107 
123111 
123114 
123136 
123137 
123169 
123176 
123338 
123436 
123442 
123449 
123494 
123503 
123533 
123619 
123673 
123729 
123819 
123960 
124000 
124006 
124012 
124021 
124049 
124059 
124243 
124308 
124314 
124315 
124350 
124352 
124357 
124390 



AA236822 

AA261852 

AA262107 

AA280738 

AA287115 

AA287347 

AA287833 

AA292654 

AA292655 

AA293470 

AA358015 

AA365515 

AA381125 

AA397822 

AA398116 

AA398155 

AA398936 

AA401753 

AA403008 

M411745 

AA416740 

AM19622 

AA421171 

AA434411 

AA449444 

AA454756 

AM56326 

AA459894 

AA463740 

AA465381 

AA476578 

AA478625 

AA479295 

AA480103 

AA486071 

AA486273 

AA486407 

AA407449 

AM87468 

AA488692 

AA489020 

AA504249 

AA598714 

AA5988C3 

AA598899 

AA599786 

AA600121 

AA608751 

AAG09200 

AA609471 

AA609778 

AA620636 

AA521785 

D57317 

D60302 

D8Q240 



H&173694 

Hs.192905 

K&104413 

Hs34892 

Hs.192843 

HsJ238205 

H&2S2913 

Hs.102506 

H&96557 

Hs.100747 

Hs301872 

H&301444 

Hs.104650 

Hs.129206 

H&97600 

Hs37697 

Hs.8166 

K&301927 

H&239681 

Hs.174104 

Hs.104800 

H&234545 

Hs.98806 



F10523 
F13673 
H66710 
H93575 
H94677 
H94892 
N21359 
N21626 
N22401 
N29325 



Hs37837 

Hs39445 

Hs39472 

Hs.75367 

Hs.108812 

Hs.101840 

Hs.194215 

Hs.106290 

H&323231 

Hs.104207 

Hs.191721 

Hs.129928 

Hs.194024 

Hs.100686 

Hs.69233 

Hs.187585 

Ks223014 

Hs.111496 

Hs.112493 

Hs.112110 

Hs393156 



Hs.158549 

r&278672 

Hs.112264 

H&287733 

H&74861 

Hs#0016 

Hs341471 

H3.13974 

Hs.74519 

Ha283713 

Hs.133525 

H3241507 

H&215766 

H&288757 

Hs.101282 

Hs.102406 

Hs7535 



K1AA1097 protein 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

eukaryotic translation Inflation factor 2 a) 

ESTs 

ESTs 

EST 

ESTs; Moderately similar to till ALU SU 
ESTs 

ESTs; H&Wy similar to simflaf to mago n 

casein kinase 1; gamma 3 

ESTs 

EST 

king cancer candidate 

T-ced receptor; alpha (V;D*,J;C) 

ESTs; WeaWy similar to K1AA0554 prate 

ESTs 

ESTs; Weakly similar to Mouse 195 mRN 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

Sro&e-adapter 

ESTs; WeaWy simitar to B00415 [C.eteg 

ESTs 

ESTs 

Ketch nxtif containing protein 

ESTs; WeaWy stmBar to aftemaflreJy spQ 

ESTs 

ESTs 

ESTs; ModefateJy simBar to K1AA0454 p 
ESTs 

ESTs; WeaMy sinrabr to secreted cement 
ESTs; Weakiy sfrnfer to Gag-Pol pofypro 
ESTs 
ESTs 

protease; senna; 15 
ESTs 

Homo sapiens mRNA; cDNA DKFZp564 

ESTs 

ESTs 

ESTs; WeaWy stmtorto I01ALU SUBFA 

ESTs 

ESTs 

rnembrane cornportent; chiomosome 11; s 
ESTs 

rn6thylrikUQfkkto€6nu^ 

ac<rvatedRhl\pofyrnGfasen(ranscn^ 

ESTs 

KUM5G11A Human fetal brah (TFuJhva 
ESTs 

prtmase; polypeptide 2A (58kD) 

ESTs 

ESTs 

Hwnosaplerorrtf?h^cDNADKFZp564 
GTP-WnaTng protein 

wal simian teukema viral oncogene hom 
Homo sapiens mRNA; cDNA DKFZp434 
ESTs 

yw37g07.s1 Morton Fetal Cochlea Homo 
ESTs; HjgWy simitar to COGWtte place 
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124438 N40188 Hs.11090 ESTs 95 

124447 N48O0O Homo sapiens mRNA;cDNADKFZp585 43 

124457 N50114 Hs266175 ESTs 6.1 

124539 N63172 Hs.146409 cefl dwision cycle 42 (GTP-binding prote 55 

5 124826 N74604 Hs.11090 ESTs 128 

124632 N79515 Hs306117 Werteukln 13 rec^tor; alpha 1 6.4 

124644 N91279 Hs.109654 ESTs; Ntodefately similar to outer membr 83 

124676 R01037 Hs.181013 phosptoojycerate rmrtase 1 (brato) 123 

124677 R01073 ESTs; WeaWysfrriftar to MAW CLASS 5.4 
10 124724 R12405 Hs.112423 Homo sapiens mRNA; cONA DKFZp586 6.6 

124773 R40923 Hs.105604 ESTs 4.9 

124777 R41933 ESTs 72 

124792 R44357 Hs.48712 ESTsWeaWysfrtotocDNAESTEJiffi 8.6 

124857 R63GS2 Hs.137190 ESTs 43 

IS 124911 R88982 Hs.180612 ESTs 4.7 

124955 T10598 Hs324841 ESTs; Weakly simBar to IUI ALU SUBFA 4.4 

124958 T11134 Hs.431 murine leukemia viral (bmMJowjgeneh 126 

125038 T78089 Hs270134 ESTs 4.1 

125092 T92544 Hs.137548 C084 anfigen (teutocyte anfigen) 14.8 

20 125132 W15495 Hs.129781 chromosome 21 open reading frames 6.7 

125144 W37999 Hs24336 ESTs 4.8 

125154 W38419 ESTs 53 

125243 W86423 Hs.105413 ESTs 6.6 

125279 W93640 Hs.4779 ESTs; Moderately slmSar to similar to AD 5.8 

25 125299 Z39436 Hs.102720 ESTs 122 

125303 Z39821 Hs288193 ESTs 102 

125304 Z39833 Hs.124940 GTP-btodTng profeh 6.8 
125474 AA151216 Ks.75103 tyros** 3wnoww)tygenaseftryptophan5fli 8 
1S509 AA044232 Hs288967 ESTs 5.4 

30 12580 AA126504 Hs267812 sorflng nexin 4 4.1 

1255B2 AA507383 H&74649 cytochrome CQJoJasesubunftVIc 115 

.125670 AI432621 Hs32685 CD47 antigen (Rh-related antigen; irifegri 4 

125698 AA748483 Hs.191356 geneialtjarecripfe* 9.4 " 

125745 AI283493 Hs.75722 ribopnorkill 62 

35 125852 H09290 Ha.76550 Hcn» sapiens mR^cONADKFZp564 253 

125972 AA434562 Hs35406 ESTs 4.1 

126160 N90960 Hs265398 ESTs;WeaWystmlartotrarBfoimafioiH 1&4 

12KS7 N99638 Hs.124084 tunwrneaosfc factor rece^ 95 

126337 AI066486 Hs.40500 simHarb&cerevislaeRERI 55 

40 126405 U46Z78 Hs.122489 ESTs 75 

125537 W40262 Hs.146310 ESTs; WeaWy sknUarto putafive p150 [H 4.1 

125590 W78968 Hs.181307 H3 Wstone; femfty 3A 4.5 

125712 AA2D5862 Hs.7942 ESTs 5.2 

126721 T72569 Ha.125359 Thy-1 eel! surface antigen 4.4 

45 126764 AB34393 Hs.102178 ESTs 4.6 

126804 AG03334 Hs.160628 ESTs 117 

125819 AA305538 H&279607 ESTs 4 

126877 A1052047 Ks261Q2 ESTs 7 

126991 R31652 Hs.821 bgtycan 5.6 

50 127479 AA513722 Hs.179729 colag^^X;atpha1(Sclvr^metoph 143 

127514 AA82G926 Hs204214 ESTs 45 

127663 W07286 Hs.10340 ESTs; WeaWystmBar to weak simflarfyt 5.1 

127677 AA916752 Ks264190 ESTs; Highly simBar to rtf=M3 [Mjtuscu 173 

127814 AA761755 Ks.136713 ESTs; Weakly s&nBar to V4-1 ptsaptens 4.1 

55 127S97 AG81549 Hs311054 ESTs 5.5 

128092 AA904617 Hs.166229 ESTs 53 

128218 KQ2682 Hs292154 ESTs;l^enileVstotDartorecomtma&) 53 

128466 D59653 Hs241471 EST 7.4 

128482 U83908 K&296251 programmed ceS death 4 53 

60 128517 AA280617 Hs.100861 ESTs; Weakly sknter to pK)katanh{H3 8.3 

128530 AA504343 Hs.183475 Homo sapiens dona 25061 mRKAsequen 65 

128559 AA226801 Hs.101448 mefastosb associated 1 52 

128574 AA412048 Hs38260 keratin 8 5.1 

1285% U31875 Hs.152677 start-chain ateohol dehydrogenase famOy 27.1 

65 128610 L38608 Hs.10247 activated leucocyte cell adhesiro molecule 132 

128629 AA399167 Hs.102708 DKFZP434A043 profieh 6.7 

128649 AA142853 Hs.103106 Hoirto sapiens mRNA for G7bputem(G 45 

229 



WO 02/059377 



PCT/US02/02242 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



128651 


AM46990 


Hs.103135 


ESTs 


6.1 


128&3 


R48943 


Hs.10315 


sohrti? cBntef femSy 7 (cstionic flinlno 3ci 


4.4 


128856 


AM58542 


1 Mb 1 WirV 




143 


\£.OI If 


"nflR17 


no. 


Mrvr» sarins rrfiNA* ffMA DKT7r£R8 


245 


400797 






InmKt IrinfKfi 1 /n rwitotn IvmsInA trrnoco\ 
. JaiUP KHWaw i \a |NUkbi ijtLwnto WHaa^ 


7.3 




N4930& 


■ Hi IWww 


ESTs: Weakh shnlar to alnha IfXVUll c 


92 


IZOfOl 


AOOJf £ 


nS.llDW3 


smsfl nucfeflr ribonurieoprokJn poiypcpti 


fid 


128793 


VfVdOOZ 


l4o 10571Q 




AR 




VT IamCO 


Via 4ftfi3Sft 


null v oo|Mao iib\iv\ wiw unrt \Ktoo 


4 


t/Do40 


AnAOOOOu 


lu 4neAQ 
rtS. IWrra 


bascfBBfltniBnibiaiifrjRducBdgsnB 


RQ 






iO> IUQI / O 


■ imiiu souitsia mvm mi uukuitc wdc^n 




1 on two 


AAonofvw 




pCTc» UImMv dmfar tn URIUPn01R7 IU 


fii 


4.OQQ0C 


UolUrO 


Uq04A51 


nufiAJ «kl|Jm3R> imw, ulnvi m\r£pooo 


64 

**" 


IHKUO 


a aa 4 none 


U» 4rtTOfiA 

ns. lur aou 


pcx<s 

COlo 


7 




Idoooro 


ns.lUr 0 10 


AjiIUIojUIc ik>4iAjiiuuAjfjfCf uao IKjlMlJCIIIIl 


59 


128946 


AAAoodo? 




proteflsoros Qvosomoc maoopam) sibunft 


U.i 


12B90O 


cartoon 
P1029U 


ns.lo0ow 


1 liintj > MnmM «Luui mDUA fAmi/ui 

nOnK) sapno uonu oo mnNA Scqucn 




129005 


AA4O0049 


LU 40001 

nS.1 JJxJ 


CCTe* Wnabtir otmlHr In CDTtft 1VI_ Akin 




129009 


AA1 31421 


LL> *7ECA7 

n3»7ooU/ 


bo IS 


0 ft 


129017 


rllolUo 


HS.10r3oo 


CQTe 
CO IS 




129057 


VCOACC 

Xd24oo 


H&276770 


uuvvox anugen (wwr a i n-i anuyen/ 


107 
IV./ 


129075 


AA1 29455 




bo IS 


AT 


129099 


1 4*ncn 
LIZOOU 


4 no coo 

ns.10ooZ3 


DifDinDDsponQin z 


A A 


129124 


a aooj con 

AAZ34530 


HS.IOOOUZ 


U --HilAul iTj JiilLLl ^ j li Li Ifliio. fjrilrLi 

N-4^yDnuw8nioe*sen5Sive taoor 


on f 


129160 


AA1312S2 


HS.1Q90U7 


coTS 


0.9 


129164 


A AOOO40O 

AA2oZ163 


u» 4 no Ail c 
HS.109045 


bo IS 


^ ft 


129180 


R40556 


1 1— O40JM 

HSw31o4u1 


cSfs; HgWy sfrn3ar to HSPC039 protein 


7fi 
f .0 


129224 


X89109 


HS.1096U6 


coronin; acfiivbindng pfotesv 1A 


40 
1Z 


129229 


A A444M4 

AA211941 


HS.109DU 


pcdyadsnylate Undlng protein-intBiactiiiQ 


7 Q 


129240 


W2436U 


1_L_ M7AM 


tnterisukn 7 receptor 


0.9 


129241 


AA435665 


HS.1097DO 


PfTn ilnrlnntnli j linll'n Iil Lllj1 It J anMt 

coTsc Moderate sutphtp hni [Mjnus 


0.4 


129243 


H88033 


Hs.109727 


KiAAU/jj protest 


711 
/jO 


129247 


AA151574 


Hs. 109733 


pO&i-Soo trmscr^Oon fiactor 


6.4 


129259 


AAD90695 


n3,1oI3Bo 


ESTs 


DJc 


129270 


Z35227 


HS.10991O 


ras hofiiolog gene fejnQy; menter H 


C A 
0.4 


129281 


AA02B318 


n&2o91Q1 


glucose reguBtBO proieiy ooku 


A A 


1293ft) 


C20976 


Hs.1 10155 


ESTs; ntgruy sunuax to noosornal protein 


0.7 


129318 


N33130 


Lb* oaoanc 


,1.1 . *Ai ilui 4] AJiMiliAiiifaMi Unn»A> <4aH4 

caBnoouun i (pnospnoiyiase nnase^ oena 


■J7 
IJ 


129319 


AA037467 


IL, 4MJA 

KSLO0340 


ESTs 


0 


129351 


A A4ffTOCfl 

AA16/2DO 


HS.0Z349 


rluntan ras Hnuosor ntviA, s ena 


a*. 


129366 


LJ40A07 

H1ou27 


KS.184097 


pjexin ui 




129383 


W92984 


H3.ZooZf4 


CCTn 

boTS 


0.9 


40MOD 

129900 


AA4C4C04 


n&l 10904 


tolS 


A 1 

H.l 


129391 


100914 


LU 444A4 

ns.mui 


uses, aUQB \uiusoprua/ tKHiAMAj j (neui 


mo 


129404 


A A4"WIC£ 

AA172Q3D 


H&3175&4 


CCTb 

boTS 


0u3 


4 OO A Oft 


N23707 


nS. 11 IlOO 


lflAAfI749 wpho rwwtiirf 
IwvWi \£> yciw piuuuu 


•f 


40Q40C 

IZSrfao) 


AAJ49A07 


FlSL 1 1 IJk67 


POT" Htnhhr shidtar tn nmtprn tnhfhilnr n 


3 


129453 


A Ait 04 444 

AAA21213 


1 1- 444000 

nS.11 lose 


LSuU piDBBIl 


o^> 


40OC4Q 
1/SfOla 


UwiP 




PQTe* WnokW cMbir tn frto.'W^ IIKIq 

EOlS» TTOKHUJ btllUtU IU KOOKWt^l |rL3 




400C4Q 

129319 


AAOafiTOC 




ESTs 


o.o 


19Qfiflfi 

ItWWl 


R21443 


Mai ivu^rt 


hMftfihodcfiQlcD nmfetn 1* aloha 


5 


1^622 


AA278243 


HU323949 


ESTs 


6^ 


129626 


AA447410 


Hs.111334 


ESTs; WeaMy simBar to 101 ALU SUBFA 


5.1 


129627 


AA258308 


Hs.71968 


Homo sapiens mRNA; cDNA DKFZp564 


53 


129628 


1)26727 


Hs.1174 


cycBn-dependent Hnase lnh3)ftof 2A (mel 


6\2 


129642 


R50008 


Hs.11806 


7^tehydrocho!esleiDl reductase 


43 


129663 


AA442768 


Hs.11866 


banslocase of forer m&Dchoixiiia) membr 


4.4 


129665 


M88458 


Hsi118778 


KDS.p.ys^sp-Gto4^)efio^bsmtore 


4 


129691 


XD6700 


Hs.1 19571 


coOagen; type 01; alpha 1 (EMers-Danlos 


6 


129783 


AA454618 


Hs.12479 


ast^xidlad nrotecute wBt ttte SH3 doynah 


6.4 


129800 


AA252436 


Hs.12540 


lysophosphoBpflSft 1 


7J 


129838 


AA452161 


Hs^06521 


YME1(SxefevEslae>«ke1 


5 


129850 


N20593 


Hs.288932 


GDPdissociaQoninhB)Bor2 


6\9 


129869 


AA1Q2520 


Hs.13015 


ESTs; Weakly s&nSar to heat shock prate 


5 


129896 


AAD43Q21 


Hs.13225 


UDP-GatbdaGfcNAc bete 1;4- galactosy 


6\6 


129982 


M87789 




trnniunoglobufln gafnma 3 (Got marker) 


4 


129985 


AA450045 


Hs.140452 


cargo sefecfion protein (mannase 6 phosp 


53 



230 



WO 02/059377 



PCT/US02/02242 



130029 AA236412 H&236510 ESTs;MocteratBlyslm8artoPFT27(Mm 5.6 

130033 M90696 Hs.181301 caihepsinS 5.4 

130036 AA195260 Hs.125849 ESTs; Moderately sMr to 101 ALU SU 7.4 

130089 AA056896 Ks.146428 cofiagen; type Vi alpha 1 7.6 
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130170 AA610070 Hs.151469 caksurn/catoxxlul^^ 73 

130189 D43947 Hs.151761 K1AA0100 gene product 6.4 
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130314 D86967 Hs.154332 WAA0212 gene product 10 
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130367 Z38501 Hs3768 ESTs; WeaHy sfrrribr to HO ALU SUBFA 63 

130378 T47333 Hs.155188 TATA box bhdfog protein (lBP)-assoda 7.1 
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AA908797 Hs.180799 



AI815990 
AA488528 
T07216 



305913 
305950 
306004 
306008 
306060 



306505 
306576 
307117 
307138 
307167 
307542 
307554 
307806 
308079 
308307 
308511 
308615 
308677 



H&293515 

H&301226 

AA397546 Hs.119151 

AI953377 t&28444 

AW299459 Hs.1 11977 

AA436942 Ha288529 

AW5Q2498 Hs.15220 

A1424014 Hs.18995 

C75094 Hs.199839 

AI337304 Hs.126268 

AW475081 Hs.172328 
AA421948 
AA456426 
AA5057Q2 
AA507875 
AA533185 

AA630582 Hs.169476 

AA6S3159 Hs.179661 

AA725116 Hs.78465 
AA738110 
AA872838 
AA876109 
AA884479 

AAB89992 Hsl2186 

AAB94560 H&283370 

AA906161 H3J6277 

AA970548 H&297681 

AA987722 Hs.172928 

AA995761 Hs776Q92 



AI184111 
AI185516 
AI190870 



AB81603 
A1351739 
A1472733 
AI5B1398 
AI687580 
AI738593 
AI761173 



Hs.76067 

Hs.172928 

H&276417 

Hs.62954 

Hs.172928 

H&276726 

Hsl27Q208 

Hs.172928 

Hs.169476 

Hs.101774 



EST ctester (not in UniGene) w& exon h 
zinc finger ptoteb 161 
EST duster (not in UniGene) wriOi exon h 
Homo sapiens mRMA; cDNA DKFZp564 
EST duster (not In UntGene)wffli exon h 
WAAQ924 proton 

Homo sapiens mRNA; cDKA DKFZp434 
rnainx inetaitopiotetoase24 (mernbrane-m 
EST duster (not tn UniGene) wffli exon h 
EST duster (not to UniGene) wift exon h 
rnuffipteUriiGere matches 
cherootdne (C motif) XC receptor 1 
ESTs; Wealdy stmBarto C11D2.4 [Ceteg 
ESTs 

ESTs; Weaidy stotiar to putative [H.sapte 
EST duster (not to UniGene) wito exon h 
EST duster (not in UniGene) w»i exon h 



EST duster (not in UniGene) wflh exon h 

EST duster (not in UniGene) wi&i exon h 

paDki (mouse) homotog; paffidin 

EST duster (not in UniGene) wffli exon h 

acQn43oe6 

ESTs 

mSogen-acSvated protein kinase 9 

ESTs 

ESTs 

EST duster (notte UniGene) wtto exon h 
EST duster (not in UniGene) wffli exon h 
ESTs 

ESTs; Weakly sinter to predicted using G 
EST duster (notte UniGene) w9i exon h 
ESTs 

ESTs; WeaMysimtar to zinc finger prute 

ESTs; Moderately similar to KlAA0456p 

ESTs; Highly sknferto NG22 pisaptens 

ESTs; WeaWy sfmfer to slmtor to PDZ d 

coflagen; type I; alpha 1 

EST singleton (not In UniGene) wBh exon 

EST 

EST stegleton (notto UniGene) wifl> exon 
EST singleton (not In UniGene) with exon 
EST singleton (notto UniGene) wffli exon 
QjycefaktehyoV3-phouj4ute dthydiogena 
ESTstogfefcn (not in UniGene)wHn exon 
EST stegleton (not to UniGene) wxn exon 
EST stogteton (not in UnEene)wifo exon 
keraftoO 

EST singleton (not to IWGene) w3h exon 
EST stegleton (not to UniGene) wift exon 
EST stegleton (not to UniGene) wflh exon 
EST stegleton (not to UniGene) wfth exon 
EST stegleton (notto UniGene) wfflt exon 
EST singleton (not to UniGene) wifo exon 
EST stegleton (not in UniGene) wfih exon 
EST stogjeton (not to UniGene) wBh exon 
heat shock 27hD proteto 1 
collagen; type I; alpha 1 
EST singleton (not to UniGene) wfo exon 
EST stegteton (not to UniGene) wito exon 
EST stegleton (not to UniGene) wfth exon 
EST stegleton (not to IWGene) wtte exon 
ESTs 

coflagen; type I; alpha 1 
EST stogjeton (not to UniGene) wflh exon 
EST stegleton (notto UniGene) wffli exon 
EST stegteton (not to UniGene) wfth exon 
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72 

53 
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63 

12 

4.2 

8.4 

52 

28.4 
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6.1 

73 

63 

5.4 

9.8 

73 

7 

1Z4 

8J 

53 

4.1 

7.7 

63 

53 

m 

4.4 

43 

73 

19.7 

53 
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83 

4.1 

6 
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4.7 

42 

5.4 

10.1 
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308852 A1829848 Hs.182937 pepfidytyHolyt tsomerase A (cydophfin A 

308974 A1872290 Hs300697 lmnumogk)b^gamma3(GmrTm^ 

AJ873242 EST shgbtan (not tnUnfGene) with exon 

AJ880172 ESTsl^l6tDn(ndb)UruG8ne)wahexon 

5 309177 AI951118 ESTskigfelDn(notinUnRBene)wffiiexon 

309186 A19S2723 Hs30207 ESTs^b!Dn(notInUnX^)w»iexon 

308198 AI965915 n^hbtooOTijafib^romifex; class!; 

309226 A1969897 EST singleton (not in UnlGene) wffli exon 

309279 AI990102 ESTsJngfeton(notlnUnK5ene)wfihexon 

10 309583 AW170035. EST 

309824 AW191929 Hs252989 EST 

309629 AWf 92764 Hs.172928 oofegen; type fc aftrfia f 

309841 AW1 94230 H&253100 EST 

309898 AW238461 Hs.73742 ribosomal poteh; large; P0 

IS 309700 AW241170 Hs.179661 Homo sapiens done 24703 beto^ubuBnm 

310073 AB35Q04 Hs.148558 ESTs 

310094 AW450967 Hs235240 ESTs 

310373 AW080778 Hs.145582 ESTs 

310438 AW022192 H&200197 ESTs 

20 310470 AI281848 Hs.194691 ESTs 

310583 AW205632 Hs211198 ESTs 

310877 T47784 Hs.188955 ESTs 

311067 AJ587332 Hs209115 ESTs 

311166 AI821294 Hs.118599 ESTs 

25 311199 T57896 Hs.191095 EST duster (not to UniGene) 

311465 AI756660 Hs20S132 ESTs 

311587 AI828254 Hs271019 ESTs 

311774 AA700870 Hs.14304 ESTs 

311785 AI056769 Hs.133512 ESTs 

30 311923 T60843 H&169679 ESTs 

311935 AA216387 EST duster (not in UniGene) 

311972 N51511 Hs.188449 ESTs 

312014 AJ435650 Hs.123778 ESTs 

312047 AA588Z75 Hs.180669 ESTs 

35 312147 T89855 Hs.195648 EST duster (not In UniGene) 

312153 AA759250 Hs.153028 cytoduDfl»t>661 

312168 T92251 Hs.1988B2 ESTs 

312172 A1222168 Hs. 191 168 ESTs 

312226 AI796815 Hs.199993 ESTs; WeaWy slmSar to uNqutous TPR 

40 312292 AW451893 Ks.151124 ESTs 

312312 A10805Q5 Hs.134529 ESTs 

312369 AA5S2039 Ks.173884 Homo saptensmRNA; chromosome Ispe 

312407 R48180 Hs.153485 ESTs 

312430 AW139117 Hs.117494 ESTs 

45 312470 AW451347 Hs.175862 ESTs 

312483 AM17526 Hs.7753 ESTs 

312521 AA033609 Hs319093 ESTs 

312544 AI498371 H&183526 ESTs 

312638 AW439195 Hs256880 ESTs 

50 312754 R99834 Hs250383 ESTs 

312772 H63791 EST duster (not in UniGene) 

312821 AA699325 Hs269880 ESTs 

312837 AW292286 Hs255G58 ESTs 

312849 AA846353 Hs.194054 ESTs 

55 312854 AA828713 Hs321Q58 EST duster (not to UniGene) 

312992 AA088446 Hs.170298 ESTs 

313096 AM22367 Hs.163533 ESTs 

313112 AA732S34 Hs269099 ESTs 

313126 AA720887 Hs283313 EST duster (not in UniGene) 

60 313136 N5S284 Hs288010 ESTs 

313197 AI738851 Hs222487 ESTs 

313219 N74924 Hs.182099 ESTs 

313258 AW068358 Hs.183918 ESTs 

313328 AW449211 Hs.105445 ESTs 

65 313352 AW2921Z7 Hs.144758 ESTs 

313417 AA741151 Hs.137323 ESTs 

313455 AW081702 Hs38571 ESTs 
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313590 


ft a on t A d A ■ I— Mi g fw 

AA804410 H&291677 


EST duster (not tn UniGene) 


C 

Ow3 


313663 


A1953261 Hs.169813 


ESTs 


fJt> 


313667 


t IAMAJ ft | A AffrA M 

U69201 Hs.13684 


_ til - ft ft . _ » W _ i_ ■ > It ^ ^ It--- * 

ESTs; Weakly similar to choline kinase b 


41 C 


313749 


AW450376 Hs.119004 


ESTs 


c e 
OO 


313632 


AW271022 Hs.133294 


■ ESTs 


43 


313881 


AA535580 Hs.16331 


ESTs 


7.7 


313915 


AJ969390 Hs.163443 


ESTs 


ryj a 
llA 


313965 


AI858884 H&270647 


ESTs 


5J 


313974 


AB10151 Hs.173524 


ESTs 


43 


314097 


AA648744.H&269493 


ESTs 


145 


314129 


AA228366 Hs.115122 


ESTs 


95 


314359 


AA205569 Hs.194193 


ESTs 


5.4 


314384 


AA535840 Hs.152203 


ESTs; Weakly sonBar to aitemafively spB 


53 


314394 


AI380563 Hs.130816 


ESTs 


133 


314462 


AA347951 K&326413 


ESTs 


6\2 


314465 


AA6Q2917 Hs.156974 


ESTs 


18.1 


314470 


AI934422 Hs30661 


ESTs 


4.2 


314488 


AA358265 Hs.182690 


ESTs 


6.1 


314506 


AA833655 hK206868 


ESTs 


273 


314510 


AI204418 Ks.190080 


ESTs 


95 


314558 


AB73274 Hs.190721 


ESTs 


225 


314661 


AM36432 Hs324239 


EST duster (not in UniGene) 


133 


314691 


AW207206 Hs.136319 


ESTs 


21.4 


314754 


AWD2G761 Hs.134374 


ESTs 


4.4 


314775 


AI149880 Hs.188809 


ESTs 


4.4 


314943 


AW76797 tte.184572 


ceB rfMskjn cycle 2; G1 to S and G2 to M 


18.4 


314961 


AW008061 HS231994 


ESTs 


10.2 


314963 


AI689617 H&200934 


ESTs 


53 


315006 


A1538613 Hs^98241 


ESTs 


20.7 


315010 


AA531082 H&240049 


ESTs 


5 


315019 


AA532807 Hs.105822 


ESTs 


6.1 


315033 


A1493046 Hs.146133 


ESTs 


12 


315036 


AA534953 Hs.163297 


ESTs 


83 


315037 


AW205863 Hs.133988 


ESTs; Weakly stater to gene MAC25 pr 


6.1 


315051 


AW292425 Hs.163484 


EST 


127 


315054 


AI968598 Hs.78768 


ESTs 


75 


319)73 


AW452948 Hs257631 


ESTs 


133 


315080 


AA744550 Hs.136345 


ESTs 


4.4 


315083 


A1221325 Hs3Q5442 


ESTs 


5.1 


315088 


AA557351 Hs.152448 


ESTs; Moderately slmlar to MULTtFUN 


4.7 


315175 


AHJ25842 Hs.152530 


ESTs 


113 


3151% 


AA972756 Hs.44898 


ESTs 


283 


315296 


AA876905 Hs.125286 


ESTs 


16.1 


315303 


AW194364 Hs.128022 


ESTs; Weakly stmQarto F1G-1 PROTEIN 


25.7 


315352 


AA604799 Hs.136528 


ESTs; Moderate!/ stmfer to 00 ALU SU 


123 


315364 


AA6436Q2 Hs.155485 


ESTs; Highly sbnSar to serine protease [H 


45 


315368 


AW291563 Hs.104696 


ESTs 


43 


315390 


AI801565 H&200113 


ESTs; WeaWy stater to attemafiveiy spS 


4.4 


315408 


AW273261 H&216292 


ESTs 


5 


315458 


AA872000 Hs.116104 


ESTs 


75 


315472 


AA828850 Hs.165469 


ESTs 


43 


315478 


AA665612 Hs.120874 


ESTs 


5.2 


315498 


AA62B539 Hs.116252 


ESTs; Moderately stmflar to Ml ALU SU 


43 


315527 


AI791138 Hs.116768 


ESTs 


4.4 


315530 


AE00852 KS.1277B0 


ESTs 


22.4 


315562 


AA737415 Hs.152826 


ESTs 


53 


315634 


AAB37085 Ha22D585 


ESTs 


83 


315647 


AA648983 H&212911 


ESTs 


15 


315652 


AI521489 Hs3053 


ESTs 


63 


315676 


AW0Q2565 Hs.124660 


ESTs 


95 


315660 


AA814309 Hs.123583 


ESTs 


8.1 


315735 


A1831760 Hs.155111 


ESTs 


134 


315741 


AA812168 Hs.122559 


ESTs 


5.4 


315769 


AA744875 Hs.189413 


ESTs 


4.4 


315978 


AA830893 Hs.119769 


ESTs 


10.4 


315984 


AI015862 Hs.131793 


ESTs 


5 


316042 


AW297979 Hs.170698 


ESTs 


14.7 
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316136 AA83080B Hs.124366 


ESTs 


4 


316177 A1S08272 


Ha^931Q2 


EST duster (not in UniGene) 


32.6 


316313 AA741300 H&202599 


ESTs 


4.8 


316405 AA757900 Hsw270823 


ESTs 


4.8 


316460 AI749921 


HSJ205377 


ESTs 


12S 


316564 AI743571 


Ha.168799 


ESTs; Weakly similar to Hit ALU SUBFA 


8.1 


316714 AA809792 Hs.123307 


ESTs 


5 


316715 AM4Q268 


Hs.170673 


ESTs 


A2 


316828 AA828116 Hs.173076 


ESTs 


52 


316869 AI954880 


. Hs.134604 


ESTs 


133 


316905 AW138241 K&210846 


ESTs 


6.2 


316943 AWD14875 Hs.137007 


ESTs 


53 


316949 AA856749 Hs.124620 


ESTs 


72 


317008 AW051597 Hs.143707 


ESTs 


4.1 


317028 AA962623 Hs.189144 


ESTs: Wealdv similar to RENAL SODHI 

tO 1 O, If COM J OS 1 UnZI HlnblWLuUUlU 


A2 


317067 A1805392 


Hs325335 


ESTs 
tola 




317069 AI732892 


Hs.190489 


COlo 




317210 AA490718 




peT rkictnr /nil In 1 Inifennl 


A A 


317298 AI922374 


Hs.158549 


cot a 




317658 AW139077 H&202217 


CO IS 


A ft 


317674 AW294909 Hs.132208 


cere 




317685 AI798630 


Hs.149997 


Colo 




317836 AA983913 


Hs.128929 


ESTs 


12.4 


317881 AI827248 


Hs-224398 


co la 


12.1 


317902 At8286Q2 


Hs311265 


CCTc 
Colo 


ft A 
o.o 


317916 AI565071 


Hs.159983 


CCTe 
COI8 


10 fi 


318042 AW294522 Hs.149991 


ESTs 


5.6 


318053 A1074465 


Hs.133469 


ESTs 

COlO 


4 


318064 AW296888 Hs.170939 


Cola 




318070 AI024594 


Hs248942 


ESTs 


4.7 


318073 AW167087 Ha.131562 


Colo 


117 


318146 AI040125 


Hs.150521 


ESTs 


5.9 


316186 AW016773 Hs3709 


ESTs 


53 


318481 A1291584 


Hs.145921 


Colo, TVcalUj dUnUal 10 nirVslrlCllUn 


7fi 


318566 AB35361 


HSL226376 


Colo 




318617 AW247252 Hs.75514 


fHiriposHa nhnsnhorvbtcA 


11.1 


318662 AQ85898 


Hs^94014 


ESTs 


163 


318691 AW192139 Hs.181307 


no iijou*io» toituij gn 


4 


318740 NMJ002543HSL77729 


EST rfttcfar/rwrf In UntGaie) 


213 


318744 AI793124 


Hs.144479 


ESTs 


35 


318948 AA317274 Hs.13996 


ESTs 


11.7 


319163 F15257 


Hs2T 






319478 R06841 


H&270307 


EST diBtfif I ml to Lfo]Ga») 


3^ 


319545 R83716 


Hs.14355 


ESTs 




319668 NM_002731Hs37773 


per rfcrator fnn* In t lntftpnpt 




319763 AA460775 


Hs.6295 


ESTs 


7 


319913 AA179304 


Hs2715B6 


EST«* MrvleralaV simitar to Ml ALU SU 

tO 1 Op mUUClOIIXIJ OJIIillOJ IU Ul! nLU Ow 


8.7 


319936 W22152 


Hs?ft?9?9 


EST difiter f not to UnKaenel 

■ %J 9 %mMOVG* ^ifv% ftl Ul Uvviiw^ 


5.6 


319951 AA307665 


Hs.14559 


ESTs 


4.9 


319962 H06350 


Hs.135056 


ESTs 


9J2 


319977 AA632632 




EST duster (not in UniGene) 


4.6 


320074 AA321166 


Hs278233 


EST duster (not In UnlGene) 


16.7 


320092 AFQ22799 


Hs.113292 


cafoafci9(nCM) 


5.4 


32010? AA836461 


H&291712 


EST duster (not In UnlGene) 


53 


320133 D63Z71 




EST duster (not In UniGene) 


5^ 


320167 AA984373 


Hs30790 


EST chister (not to UniGene) 


15 


320187 T99949 


Hs303428 


EST duster (not to UniGene) 


6.7 


320211 AL039402 


Hs.125783 


DEK£-6proies) 


243 


32)401 U90449 


Hs.152717 


rujdeostiecfiphosj^ktoasetype6(toh 


10 


320456 A1884396 


HS24131 


ESTs 


5.4 


320488 R31386 


Hs,1917Si 


EST chister (not to UniGene) 


4.9 


320521 N31464 


Hs24743 


ESTs 


9.5 


320661 AA864846 


Hs.115175 


EST duster (not to UnlGene) 


6.6 


320891 R61576 


Hs313951 


hypothetical proteto 


5.9 


320699 R63161 


Ks.118249 


EST duster (not to UniGene) 


4 


320727 U96044 


Hs.181125 


EST duster (not to UnlGene) 


153 


320993 ALD50145 


H&225986 


Homo sapiens mRNA; cDNA DKFZp586 


72 
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321012 AA737314 H&194324 


PITT chister tndt hi Unk^nfi^ 


6\1 




POT rfcictor tnri bi I )n flan/A 
CO 1 aU5an \UUl at UiUUtNIc/ 


5 


321051 AF134149 Wc24fmS 


per chstor (tuA bi UnfGaiei 


11.4 


321171 At7fi9d1Q H<t221461 


Cola 


7.7 


32113? AA2S53Q4 Hq9Q7Q3Q 


. pcT«a> UffMldv ehmfcir in nemefdn fMjan 
COl«S ntniiyj ouiuioi vj i raajc* in i|rLoa^r 


SJS 




per (4iK|nr /not b Unk^enel 


16£ 


004007 uyMM4 Uq 44497ft 
Oil dor nDOuHt nS.141£#0 


co is; w daily auiiuta si un ALU ouotm 


42 

*t»4. 


WLA19 AuncROAC U c TOftCH 
3£r*\£ AYfODDOUO nSX£09l 


pert Hitdor fnnt In 1 Intftfwwi) 


63 




p<rr<r Mryfeniteltf shnflar Id lift ALU SLI 


g 




A DDO J1» Lwalatpjl tmnlnln ^ utk^nA Iiimh 

tvir£ (aoiiHcwieo pmran yessiy nimi 


1 i«o 




CO IS 


104 


994ftf!C iftOJWOfl Us 770U07 


CO f uuoun ^irh in uruotaio/ 


193 


0x1031 nrf 1 Of 'it** TO, IDOiW** 


COIS 


5.6 


vMcwrt ucmcc Lio. <rv4mn 
rro/uoo HSJcnWU 


CCTcf UlAalrttf c^mllor \n till At 1 1 CI IQPA 

col s» wrawj sunuar ro uu alu ouota 




AnrUOOZDO n3L/SfiflWJ 


PCTir WpakW dhdbr h> Nil AI 11 HI ASS 
CO ICS rrcoKlf SuHBkm lit iiti r\LU IH/iOO 


63 


3ZI9/0 N/734Z HSwZlooT 


COT j4rieior /nnl tn 1 IntRjMtot 

col GusiBr \noi in unioenB) 


10 9 
tux 


MM47 U_ (WftO 

oZflil/ AA3hXJo9 H&919Z 


ESTs 




Scaiat aa/joo// nSucoooro 


In*** ntamnllu UnnnmlnJii m/VMitnr f f^mtHal 

sjw uBnSuf upopfDicxn icospuji (lanniKD 


£.1 JO 


3ZOJ35 AL13751/ n3J0o2Ul 


CCT jAintn. /——A Vk t (nKnnn\ 

coi aussr (not n unjoene) 




322171 APD859S9 H&48474 


CCT jJluiIj - #m>A Lt 1 )«V> Mn \ 

coT duster (not in uniGeoe; 


*/ 


322175 AF085975 


col QUSiBr (not tn unio&ie; 


77 


322236 AL134970 Hs.104222 






3S303 W07459 HS.1 57601 


hoF cnstBr(noi in unxiene; 




322735 AA0B6123 H&297856 


C*5T jJ.tj.4rL. /«nt U. 1 

EST duster (not in untGene) 


7ft 


322777 AA679082 H&26&47 


ESTs 


4.4 


322818 AVVD43782 H&293616 


ESTs 


14 

21 


322882 AW248508 H&279727 


DiGeorge syndrome crificaJ region gene 2 


4cq 

loo 


322975 C16391 


EST cluster (not In UniGene) 


04 1 

213 


322991 01892 Hs.1 59473 


ESTs 


44 T 


323u11 AA58Q288 


EST duster (not in UniGene) 




323091 AWD14094 Hs21u761 


ESTs 


4rt ft 


323107 AJ3011Q7 Hs.150790 


ESTs 


ft i; 


32313d ALi2Dj5i r&30177 


EST dustBT (not in urauene; 




323168 AL120862 Hs.124165 


ESTs 


17.9 


3Z3195 Aluo438Z HS.1 17950 


ifTTfm i JT i ll mAjhwIU, — ii wtiUrr In OA 


C ft 


3Z32D1 AUM937D HS.13350 


Homo sapiens dtkna, clwv UKriCpooo 


14 ft 


mMam A A0AO41C LI* 44A4DC 

3Z32D3 AAZQ3135 HS.1 301 W 


coTS 


ft/1 


323243 W4437Z HS. 110/71 


cot duster (not tn unftene; 




323244 T70731 Hs.1 93620 


EST cbster (not m UroGene) 


4ft ft 


3Z33ZB AAZZoUro HS259096 


r-CT /nnl tn 1 UV2ann\ 

coT dustBT (not in unibene) 


Aft 


3Z333Z AI8Z95Z0 n3JtZ/0l3 


colS 






co r auscBT (not n uniuenei 


Aft 
0.0 


3235/0 Aumodzj rtS^Uo/52 


CCrr«> Uln^L4» m&mJbur In 1111 AI 1 1 OJ IDCA 

tSTsi weaxiy stnwar to uu alu sudta 


3 


WacM AI7C44O0 Lin A4'Ty4 

323604 Al/51400 H3.41Z71 


CCTo* LA/iV^trLt oWrtA^f fn lilt AI 1 1 CI tDCA 

coTs; vveawy snnuar id iui alu suw-a 


ft ft 


323ooo AAo442D5 rQu»90oo 


coi G«niBr(noi hi uiuuGnej 


7 1 
/.l 


hh&nArn Uo 7mee 
oZSto* mau1\j< nSL/Ufbo 


CCT 4*hidei> /n>M m 1 lntfSrHw\ 

col uU5Im (nn hi unwene/ 


R1 

D. 1 


02381/ AA41U94o 


col c&jsiej (nui in urauene/ 


1ft ft 


OZ3B4D ARxWo/4 rfSL4li4U 


COTo* tAfmlMb flfcuiVn A— iuuImii fn fSTIAtvn 

co i s, vvranj SBnoBa w wacBw jneEin 


101 

1U.1 


MMM AAC7nftBn U«. Q-fT% 
ACJiWtf rww/UOSM) nSdOl/J 


pCTe 
COI 5 


64 


009007 AAQ4AOTT7 Lin T7A>lc4 

3/Sfijf AA5445AJ/ 119^274454 


coi Custer (not n unioens,) 


g 
0 


aZ4v*r nAJ/aZUi KSwZ/1340 


co f custer (not 01 uniuene/ 


ft 1 




PCT i^ieia. /nfU in |)nir^«K»\ 

co i wuinpi yiim in unn^jtR?/ 




324302 AA543008 HS292471 


ESTs; Weakly sfanBar to uH ALU SUBFA 


5.7 


324338 AL138357 Hs.145078 


ESTs 


9^ 


324344 AW5K000 Hs.46677 


EST duster (not In UniGene) 


4.4 


324432 AA464510 Hs.152812 


ESTctoer(nc^uiUnR3ene) 


167 


324495 AW501411 Hs.122489 


ESTs; WeaWy strnflar to 1111 ALU CLASS 


53 


324497 AW152624 Hs.136340 


ESTs 


5.4 


324598 AA5Q2659 H&163986 


ESTs 


&8 


324603 AW016378 Hs292S34 


ESTs 


23.1 


324620 AA448Q21 H&94109 


EST duster (ixrt In UniGene) 


21.2 


324727 AW10425 Hs,19597 


ESTs 


5 


324774 A1G31771 Hs.132586 


ESTs 


5 


324783 AA640770 Hs.200994 
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TABLE 13A 

Table 13 A shows the accession numbers for those pkeys lacking unigenelD's for Table 13. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Ptey: Unique Eos probeset WeirtHier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 

Ptey CAT number Accession 



123619 371681 1 AA6G2964 AA609200 
20 103207 30835.-4 X727S0 
103349 11052__-2 XS9059 

110856 19346.14 AA992380 N33063 N21418 H79958 R2191 1 H79957 
113248 328626.1 T63857 AW971220AA493469T63699 

123169 44573.2 AB50087 N70208 R97040 N36809 AI3081 19 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 
25 AA908598AA251875AI820501 AI820532W87891T85904 U71456T82391 BE328571 T75102 R34725 AA884922 BE328517 

AI219788 AA884444 N92578 F13493 AA927794 AB60251 AW874068 AL134043AWQ353WAA663345AW008282AA488964 
AA283144A!890387AI95Q344AI741346AJ689Q62AA282915^ 

AI762688 AA988777 AA488892 A1356394 AW103813 AB39642 AA6427B9 AA855975 AW505512 AB6153Q AW629970 

BB612881 AWZ76997AW513601 AW512843 AA044209 AW856538 AA180009 AA337499 AW961101 AA251669AA251074 
30 AI819225 AW205862 AI683338 AB58509 AWZ76905 AB3300S AA972584 AA908741 AW072629 AW513996 AA2S3273 

AA969759 N75628 M22388 K84729 H60052 T92487 AK22058 AA780419 AA551005 WB0701 AW613456 AI373032 AI564269 

R»531H83488V^181VV78802R56056AI002839R67840AA 
123533 aenban)LAA60a751 AA608751 
116480 genbanLC14088 C14088 
35 132225 genbanK_AA128980 AA128980 
125154 genbanLW38419 W38419 
118475 genbanKJi66845 NG6845 

102919 25180.2 M21 191 AL035748 AA021266 AA323126 AA180515 AB13029 D28356 NrvL00Q034 Ml 1560 AW401425 AW248248 AA012956 
AA323294 W04965 H38759 AA206622 AA580747 AI541172 AA381075 AA354229 AVW02353 AW4Q5575 AW404021 AW406207 
40 AA075752 AW176066 AA287222 AA19581 8 720243 RS7945 AA295539 AA4Q2533 AA232419 AA224515 AW401583 AA331367 

AW402140AW249079R31488AA075757X05236AV^ 

AA216544 AWZ75288 AA227044 AL038124 AA243300 C03242 AA315615 AL035840 R64336 AA313917 AA018963 AA001385 
AA054395 H30840 AW498825 AA086141 AB57324 AA121576 K39128 R77161 AA019688 AA380987 AA348140 AA348257 
AW176086 AA362432 AA171389 AA362416 AA299938 AA319093 AA337972 C04921 AA345696 R8954G AA085425 AA481708 

45 AA313637 ALQ39229 H84490 K86153 HW656 AA326668 AA347304 R65890 H41949 AA339309 AW4020Q2 AW404854 

AA1S2582 AA1128Q2 H09248 N83165 H38367 AA3S6339 AA455763 R66853 AA294935 H85911 AA310414 H93436 N87014 
AA001 186 H83640 AA41 1328 AA317929 C04192 AW406288 U46335 AA323179 AA427649 A1366131 H14328 AA197161 
AA379497 AA31 1816 AA017206 AA001137 AA017420 AA012990 AW163775 AA021397 AA295513 AA355246 AA374921 
AW380419 AA345864 AA318058 AA371711 AA363255 AA057094 R88057 AA394045 AW36274 1 AA479579 A W362789 

50 AW362775 AA223624 AW362809 AW362746 AW382753 AW380412 AA057104 AA295Z71 XD6352 AW36Z771 X12447 R59553 

R87325 AA292864 R83920 F01 120 AA496740 AA852551 AL039209 AA093339 AA333961 N86416 C03325 C03831 AA194750 
AA359270 AA359460 AW245492 AA339400 AA330784 AI908970 N89223 AW361938 AI940063 AA1 34323 AA371741 F01267 
AW372970 AA341973 AA346098 AW372969 AA337549 AA327342 H93855 AW74079F29118AA852940F35696AA345963 
AA079578 AA113785 C02989 AA095945 C03145 C05199 AA346Q24 AA190506 AW361659 AI909845 AW374374 AW374382 

55 AW374401 AW374373 AW374370 AB09831 AW374367 AA353858 F01041 C02843AA375948AW374414AA213946AJ525039 

H13744 R31Q07 AA112044 AA134404 H47935 AW177018 AA429768 AA336873 AA112B75 H46393 AA191267 D59131 
AW406Q37 AA055244 AA341880 AA179024 AA308537 AW406985 AW32731 1 H30301 AA300705 H43788 AW364149 
AA806213 AA481936 C04941 AW375299 M21190 AA410818 AA250940 AC54547 AA317422 AA250903 AI865497 AA890603 
AA366197 AW498538 M78072 AW406461 C03092 FO0308 H56488 AA336320 AW406501 AA354102 AA382942 AA098393 

60 AW376830AA383446F01259AVV081388T94036AA379643H43M2AI5240^ 

H44129 N84794 F01 135 AA477852 AA293062 AW361595 H27194 A1831650 H43253 H24797 AB64680 AA380090 W20057 
AB21586 AJ192549 AW090808 H25967 AA918121 AI626060 H20221 AA812572 H42178 AA887222 H96000 C03180 F00946 
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C03986 AI318091 A1860172 AA582179 A1633388 AA557193 R68075 F24105 AW518239 W56622 AI625219 AI925243 
AW468046 AI921828 AA339164 AI144391 AA643334 AA459631 AA873247 AW373432 AA604384 H27600 AI680458 AA159956 
AA610836 AA384298 AW373435 AA6Q4225 W73754 AW087924 AA599776 N892Z7 AI830871 A1633128 AW514329 AA01O455 
AA563928 AI571596 AI128394 W73707 AM23575 AA5B3809 AA657988 AI950837 AA169782AA600009AI885540AA771BB4 
5 A1978829 AA505408 AA533937 AA481469 AA610869 AA775241 AW273870 AW0709D9 AI905695 AA4801 1 5 AA574051 

AJ8891B5 AA773167 AA331375 AA001437 AA194324 AA1943C0 AA558632 AI038538 AA41 1329 AA781570 AI833176 
AA935520 AW074ig7 AA583063 AW073099 AWC01198 AA948Q25 AA587857 M191540 AM60085 AA193244 A1538Q37 
AA515572 AA758587 A1149311 AA508610 AA206409 AA5340O4 AA994600 AAB27543 AB16349 AW245123 AW517804 D25663 
AA781985 AA2B4536 AI819422 H16040 H27531 AA456564 AA845555 AJ423596 AA012908 AA889439 AA716311 AA968S68 

10 AA320508 AA725731 AA834202 AA935997 AA724815 AA769353 AA594603HJ0827 AI342442 AKJ03519 AKH2503 AB47597 

AI040946 AA197162 AA987883 AA292865 AWD01944 AI64071 1 AW244044 AA456784 F30588 AA290829 H24754 AI97B683 
AA483686 AA583939 M121382 AA633831 AA477102 AA977322 AA666379 F35456 AA993537 AI749610 AA226934 AA7162D4 
AW513Q25 AA628543 AA583705 F25702 AD68748 A1124097 AB80086 AA477513 AI758834 AI690753 AA477746 F37761 
AA642243 AA159957 AA250844 AA459406 AA427566 F25054 AI569314 AA961665 AB22050 AI75900Q AA555236 AA514432 

1 5 AA293474 AA001129 AA82S789 AA641390 AA134405 F35585 AA477416AW193359AJ351315 AA2W988 F36340 AB61322 

F26969 AA991922 AA021267 F26973 AI361314 F35891 A1918509 AA25Q964 AA190992AA577139AA865535AA134324 
AW192842 AC24046 F1B975 AA779626 AA856894 AW269997 AW014614 H95554 F31378 AA374668 F26343 AA654007 
AI830942 M113195 F26432 W56652 AA464690 AA055263 AA340654 AA031448 AA976399AA972526AAD63476R83321 
T16240 AA5332S0 N91545 H44053 AA683451 AA513761 AA0B6477 H09249 F20482 F26737 AA054148 AA857D83 AA017259 

20 AA179789 AA088908 H43704 AA194320 F35950 AI880127 F20441 F3287B AA962483 H39094 H56489 H44621 F19390 

AI302232 F25162 AAB26965 AA086052 AA917410 AA454513 R59554 AA196755 AA086369 AA07953O H28108 AA243301 
AI025737 AA101239 AA088887 AC14910 AA974886 F16089 F26054 AA515092 F33436 F32829 M78061 AA235645 F19715 
F37529 AIB1 1549 AA665160 AA708200 F01 124 F32382 AA346220 AA627361 F30741 F30010 F28543 AA211715 F20245 
AA331222 F25634 F21996W28215 F21911 R65793 AI192566 H20130 H84491 AA719223 AA557435 F16967 F26989 F30353 

25 AAB57159 AA291918 F28234 F20840 F25176 F22437 T27904 AA480355 F19528 R87928 H14286 F27532 AW337864 F2B411 

H13892 F25651 AA975454 F24229 F29G57 F18Q24 AA464779 F17588 F34954 AA947328 F18063 AA657777 AA459644 N91455 
F27850 F29608 F27206 F18418 F31459 F18564 F33496 F16376 F29740 F29843f29904 F29866 F19135 AW276602 T40337 
F24835 F34672 F26474 AI926215 AA464185 F18217 N26193 ALM3256T41197 F33055 F00388 F29500 F34191 F33297 
AI937207 F2Z724 F15909 F26232 F18889 AA318827 F29085 AA872104 F17509 F23373 F15660 F17552 F17412 F16B63 

30 F34033 F21515 F17364 F18383 F16546 F17551 F17260 AA292000 F15723 T47438 F16798 F18048 F18319 F17978 F17566 

F34230 F33258 F20860 F17998 A1695701 
118600 genbanK_N69222 N69222 
118952 genbank_N92966 N92966 
120873 genbankJVA358015 AA358Q15 

35 113702 genbanK_T97307 T97307 

129982 12AJS! Z14221 AW381862 M97920 AW401444 Z66542 M29470 AW406502 X6101 1 M34024 AA327072 Z14166 214167 Z14165 

AW403806Z14200AA383972Z14205Z14201 M18513 Z14202 AW403684 X14584 AF062221 U43760 X65892 X65883 X62107 
Z80847 X65885 XB5893 AF062142 X65891 X17675 Z47274 Z47277 Z47276 X65888 Z47275 XB2109 AF082140 L01278 
AF062134 AF062139 X81723 Z80840 X81733 X81743 X81744 X81732 Z80843 AW402942 AW403516 XB5919 AF062190 

40 AF062177AF082222AI^15Z47240AF062263AR^1AR)62223A^ Z47238 AW401714 AW404008 AW404991 

AF062280 M99595 Z47214 Z47232 Z47218 M26995 AF0621 84 X65895 L38433 X81731 Z1 1946 Z47226 AF062205 AF174012 
L01Z76 AF062168 AF062136 X81755 XB1748 AF174019 AF062285 L06924 L11699 AW402B65 L09070 L28049 L08090 
AW407843 Z14171 AW4Q2944 L08083 AW405627 L33035 L26907 M17750 L28052 M17751 AJ239360 019885 L14821 X56526 
AB014341 L12087 L12098U68231 L12184AR)62242AF{K220(K)U64499U00570AI268G04Y15773 

45 X73605X99360UOQ577D83677AB021539AF035796Z33899U^^ 

D83676 AB021529 Z18318 U00488 101412 XB1746 U21262 U21272 U00560 AF174060 U0Q547 U00561 Z18321 S73957 
M26435AF115130AF115inZ92896 U21254Z32895AF115112AF062290T28^ 

077373 AF174057 M17749X69692AF174Q38 U64478 11644% AF174098 X99365 AF174077 AF174083 AF174089 AF015133 
AF127792 AJ0Q6171 U00510 U58144 AF004323 AF1 15109 AF0Q4324 AF015130 AF090414 AF090418 AB021536 AB021519 
50 Z96957AITO986Z92898Z96956L34164AF062251AR)52524U)Q549U0^ 
AF022011U3085lX)05mi9915L43083AH 
AF062298AF062289AF0S2307AW4O8326L33O37LO4337IJM328X8174 

AF191092 AB0S954 A^1031&4 L04343 AW364860 AF001424 AF103163 298717 AF103143 AJ008250 L04323 AF103321 
AF103335 L04342 LD3818 L03817 AW404978 AW403690 U86801 AF103150 AJ010334 AF035027 AJ007327 AF103115 

55 AFO17458AJ0082Q7AJ00&183AJ008196AJ008241 AJ00a208 AF103210 AF068668 AFOS8670 AF068671 AF068664 

AF068669 AF088666 AF068665 AF068672 AW371 244 AW403670 AW408074 AW404575 AW3621 53 AW403803 AW406702 
AW351514 D78345 T29140 J00231 NM_002179 AW405146 AA301091 XD4646 H64660 AW402990 AW4C6534 T93007 
AW57980 AW36B899 AI905B33 AW406586 AA482084 A1872299 AA71 5266AW404328AI831674Ar709348AA603112 
AW514864 AA485775 K64492 AW404789 AA487630 AA715498AA295885T27613 T98113 

60 115763 genbanKJ\A421560 AA421560 
124357 genbanKJC2401 . H22401 
108733 5O4107J AA121022AA126422 

101544 entoLM31169 M31169 # 

124447 9enban)LN4800O N48000 
65 124677 genbank_RO1073 R01073 
124777 genbanKJW1933 R41933 
119302 genbanKJ25725 T25725 
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103680 entrezLZ93784Z93784 
135029 H58818_at H58818 
112253 genbanlLR51818 R51818 
Ptey CAT number Accession 
5 322175 46877J AF085975H53458H53459 

323011 139750J AA580288 AA315655AA133031 AA377748 
322975 1510563J C16391C16413 

317210 211994.1 AW881 145 AA490718 M85637 AA304575 T06067 AA331991 

323817 233566 J AM10943 AW948953 AA3342D2 AA332882 
10 309583 1046029.-2 AW170035 

324961 376239J AA613792 AW182329 TD5304 AW858385 

303502 325188.1 BE174240 AM88528 AL042253 

320133 447553.1 BE151746BE336853D63271T94955AA774994 

311935 174129.1 AA215387T63548 AA228676 
15 321050 5Q2195J C0592BAW3S34S7 

319977 345248.1 AA534222 AAS32632T81234 

312772 4380 7 AW962299 AA310349 AW982294 H63791 H63751 

321354 11GQ28.-2 AA078493 

336512 CH22_3941FG_834_7JJh»eDJ 
20 336558 CH2^_3992FG_84^JIJJNKJW 

336560 CH22_3994F6.842_5_UhBUM 

329367 cjUa 

3293/3 c_x_hs 

336676 CH2£_4154FG_43_4_ 
25 338008 CH2^6490FG_UWCBfcACOO 

338057 CH2^6558FG_UW^EMAC00 

329655 c14_p2 

336959 CH22J764FG.367J3. 

329899 c15_p2 
30 329960 d6 _p2 

338410 (>t22_7067FG__UNK3tAC00 

338451 CH2?_7124FG_UNK_^tAC00 

338588 CH2^7331RLJJNK^MAC00 

338665 CH2^7438FG_LMeEMAC00 
35 338689 CH2^7464FG_UNJ^BtAC00 

308677 AI761173 

338832 CH22J678FG - _UNKJM246D7 
338980 CH23J859RLJWJ1A59H18 
333009 CH22J33FG_61_1JJNJCEMA 
40 333010 CH2?^FG_61JLUNK W EMA 
333013 CH2Z_237FG_61JJJN)C£MA 
308981 AI873242 
308995 AI880172 

333108 CH22_336FG_79J4_UN>eEM: 
45 333139 CH2?_368FG.83JBJJNK3*: 

333254 CH22J95FGJ18 JUJNJ^t 

333305 CH2?J550FG_137JLUNK-EM: 

333343 CH22J89FGJ39.12JJNKJEM 

333388 (m.634FG_144U3JLWK_0* 
50 326213 c17Jb 

333456 CH2^706FG_157^5JUNK_EM: 

333459 CH2?J09FG.157_8JJNieEM: 

333517 CH22_773FG_173JLlW)eat 

333585 CH2Z_846FGJ»3^4_UNK3t 
55 333679 CH23_941FG^47_6JJNK_0* 

326474 C19J» 

333743 CH2^_1009FG^264_1JJNKB« 
333758 CH2?_1Q24FGJ268JJJMCEM 
333767 CH2£JQ34FGJ271_6JJNK^/I 
60 333768 OC^1035FGJ71JUJNKB« 
333769 CH2^_1036FGJ271JLMNKJai 

333795 CH2^1063FGJ?75JJJNK.EM 

333796 CH22_1065FG^275JJJNK^M 
335004 CHZ2J326FG.472J3 _LiNKJBrf 

65 333892 

335115 CH2?J2447FG„496JLUfleB* 
333904 CH2^_1176FG^294jaiNHLBrf 
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333905 CH22J 177FG_294_3JJNK_EM 
333921 CH22.1 194FG_296J2JJNK^ 
333968 CH2^1245FG_307j4_UNK-EM 
328059 cJJts 
5 335287 CH2^629FG_526J1J4NK_E 
326816 c20_hs 
326617 c20Jb 

335342 CH22J689FG.536J JJNK^I 
335491 CH22J843FG_570J3JJ^ 
10 335495 CH22J847FG_570_^8JJNKJE 
335495 CHZL285*G_571_7JJWea« 
328304 cl7Jb 
3G5453 AA738110 

335544 CH22_2899F6J76J5JJNK_EM 
15 335610 CH2ZJ!963FG_583_4_UWeEM 

335653 CH2i.3013F6_590_4JJNK-EM 

335682 CH23J043RL595JLUMCEM 

335687 CH2W048FG_596JLUWja4 

328492 cjja 
20 335755 CH22.312ffG_604_4JJN)eB« 

335782 CH22^3151FG_609_4JJNK^M 

335791 CH22L316OF6_611J_MWe0iA 

335809 CH2*J181FGJ17JBJJNK_P* 

335822 0<22J195FG_619_JJJW_EM 
25 335823 CH22J196FG_619_8JJNK_EM 

335824 CH2?_3197F6_619J1JJWeE 

335825 CH2£_3198FG_619J2J-WK^ 
3358% CH2^3272R3^635_3_UWeEM 
335917 CH2^J294R3_636_13JJNieE 

30 335920 Ot22J297FG_636_16JJNK_E 

305898 AA072838 

305913 AA876109 

305950 AA884479 

328857 Q_7Jis 
35 330084 c19_p2 

337968 CH2^6419FG_JJNK^fAACOO 

309177 AI951118 

309198 AI955915 

309226 AI9S9897 
40 339352 CH22_8317FG_JJNK,BA354I1 

309279 AI990102 

339373 CH2*3348FG_JJNKJA232E1 
325622 c\4Jts 

334102 CHZL1380FG_327_60JJWUE 
45 3329Z7 CH22_148FG_38JJMeC20H 

332929 CH2SJ5QFG J8JLJJNK.C20H 

332930 CH2^151FG_38j4JJN»eC20H 
332955 CH2^179FG_48J2JJhneEM: 
33358 CH2^182FG_4fiLl5_UNJ^ 

50 332961 CH2^185FG.48_18JJNICEM: 
332983 CH2£J07FG_54_5jJNK^fcA 

334222 CH2^1506FG_360J_UNK_B4 

334223 CTO_1507FGJ360_4JJNie94 
334264 CH2^1551FGJ67J5JJNieE 

55 327110 C21J» 

334343 CH22_16^G_375^25JJNK_E 

334360 CH22_1654FGJ78_5JJNICEM 

327196 <l1Jjs 

327283 OJ» 
60 327313 qJJ» 

304465 AM21948 

304507 AM56426 

327450 Qjjs § 
304591 AA505702 
65 304601 AA507875 
304659 AA533185 

334784 CH22J096FGJ3?_9JJNK_EM 
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334789 CH2W101FGJ3?_14_UNK-E 

334794 CH2^06FGj^JLMNKJM 

336035 CH2SJ420FGJS78JBJJNKJXI 

336042 CH2^.3427FGJ579JUMUW 

5 336093 CH2Z_34B1F6_691JLUNKJXI 

336096 CH2^.3484FG_691_5JJMUy 

334889 CH2^J206FG^45^3JJWeEM 

336150 CH2^_3540FG_708J5JJNKJ)A 

336152 CH2i_3543FGJ06_9JJNKJ3A 

10 336416 CH2^3833FG_823_38JJW^P 

336444 CH22_3864FG_827_10_MNKJ) 

336449 CH2?_3870FG_829 JIJJNKJXI 

336471 CH2^3894FG_829J0JJNKJ) 
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TABLE 13B 

Table 13B shows the genomic positioning for those pkeys lacking unigene ID's and 
5 accession numbers in Table 13B. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 



10 



15 



20 



Ptey. Unique number corresponding to an Eos probeset 

Refc Sequence source. n»7Q^Bm)mberainttteco!u!ro 

emitted The DNAsetpiencerf human chromosome 22." Dunham Let at. Nature (1999) 402:489-495. 
Strand: Indicates DM^ strand from which exons ware predicted. 

NLpostttonc Indbates nudcofldt: positions of predicted exons. 



Pkey Ref 



Strand NLposifion 



332955 
332958 
332961 
333139 
333254 
333305 
333368 
25 333517 
333585 
333679 
333767 
333768 
30 333769 
333795 
333796 



333921 

35 333968 
334102 
334264 
334343 
334794 

40 334889 
335287 
335491 
335495 
335498 

45 335653 
335687 
335809 
335822 
335823 

50 335824 
335825 
336035 
336033 
336096 

55 336444 
336959 
3&008 
338057 
338410 

60 338568 
338665 
338832 
338980 
339352 



Dunham, I. etaL Plus 

Dunham, L etaL Plus 

Dunham, I. etaL Plus 

Dunham, I. etaL Plus 

Dunham, I. etaL Plus 

Dunham, L etaL Plus 

Dunham, I. etaL Plus 

Dunham, L etaL Plus 

Dunham, L etaL Plus 

Dunham, L etaL Plus 

Dunham,!. etaL Plus 

Dunham, L etaL Plus 

Dunham. L etaL Plus 

Dunham, L etaL Plus 

Dunham, L etaL Plus 

Dunham, L etaL Plus 

Dunham, I. etaL Plus 

Dunham, I. etaL Plus 

Dunham, I etaL Plus 

Dunham, L etaL Phis 

Dunham. L etaL Plus 

Dunham,!. etaL Plus 

Dunham, L etaL Phis 

Dunham, L etaL Plus 

Dunham, L etaL Ptus 

Dunham,!. etaL Phis 

Dunham, 1. etaL Plus 

Dunham, L etaL Plus 

Dunham, I. etaL Plus 

Dunham, L etaL Plus 

Dunham. I. etaL Phis 

Dunham, L etaL Ptus 

Dunham,'. etaL Ptus 

Dunham, L etaL Plus 

Dunham, I. etaL Plus 

Dunham, I. etaL Ptus 

Dunham, I. etaL Plus 

Dunham, I. etaL Plus 

Dunham, I. etaL Plus 

Dunham, L etaL Phis 

Dunham,!. etaL Ptus 

Dunham, L etaL Plus 

Dunham,!. etaL Ptus 

Dunham, L etaL Phis 

Dunham,!. etaL Plus 

Dunham, L etaL Plus 

Dunham, L etaL Plus 



25088962508992 

2516164-2516310 

2521424-2521555 

33694954369571 

25214243521555 

46303884630645 

49137494913805 

5570729-5570925 

62347786234894 

70687957068896 

7694407-7694623 

7695440-7635697 

7696625-7696707 

7807688-7807795 

780825*7808319 

81568258157001 

83803258380441 

8681004-8681241 

9995140-9998373 

13234447-13234544 

13655828-13656307 

16374312-16374458 

19288024-19286515 

22298047-22299299 

24128651-24128827 

24140688-24140872 

24172082-24172161 

25329710-25329802 

25445952-25446064 

26310772-26310809 

26364087-26364196 

26365925-26366004 

26376860-26376942 

2637817526378268 

29016748-29017410 

29556922-25557002 

2957B87&-29579047 

3419058534190718 

13233040-13233126 

76970687697236 

8526397-8526522 

19292807*19292916 

22896767-22896920 

2447265424472853 

2777512827775290 

29896789-29896874 

3354478443545121 
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332929 
332930 
332983 
333009 
333010 
333013 
333108 
333343 
333456 
3334S9 
333743 
333758 
333904 
333905 
334222 
334223 
334360 
334784 
334789 
335004 
335115 
335342 
335544 
335610 
335682 
335755 
335782 
335791 
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15 



20 



25 



30 335917 
335920 
336042 
336150 
336152 

35 336416 
336449 
336471 
336512 
336558 

40 336560 
336676 
337988 
338451 



45 



339373 
325622 
329G55 
329899 



50 326213 
326474 
330084 
326816 
326817 

55 327110 
327196 
327283 
3Z7313 
327450 

60 328059 
328492 
328304 
328857 
329367 

65 329373 



Dunham, I. etaL 
Dunham, 1. etaL 
Dunham, I. eLaL 
Dunham, L eLaL 
Dunham, LetaL 
Dunham, L etaL 
Dunham, L etaL 
Dunham, L etaL 
Dunham, L etaL 
Dunham, L etaL 
Dunham, L etaL 
Dunham, L etaL 
Dunham, L etaL 
Dunham, L etaL 
Dunham, I. etaL 
Dunham,!. etaL 
Dunham,!. etaL 
Dunham,!. etaL 
Dunham, L etaL 
Dunham,!. etaL 
Dunham, L etaL 
Dunham, L etaL 
Dunham, I. etaL 
Dunham, I. etaL 
Dunham, L etaL 
Dunham, I. etaL 
Dunham,!. etaL 
Dunham, L etaL 
Dunham, LetaL 
Dunham, LetaL 
Dunham, LetaL 
Dunham, L etaL 
Dunham, LetaL 
Dunham, I. etaL 
Dunham, I. etaL 
Dunham, LetaL 
Dunham, I. etaL 
EXjnham, t etaL 
Dunham, LetaL 
Dunham, LetaL 
Dunham, LetaL 
Dunham,!. etaL 
Dunham. LetaL 
Dunham, LetaL 
Dunham, LetaL 
5887000 
6448516 
6563505 
5091594 
5887224 
5867405 
6015302 
6552458 
6552458 
6117842 
5867446 
5867478 
5867501 
5867766 
6117819 
5868455 
6004478 
6381927 
5668842 
6682537 



Minus 
Minus 
Minus 
Mows 
Minus 
Minus 
Minus 
Mnus 
Minus 
Minus 
Minus 
Minus 
Mnus 
Minus 
Mnus 
Mnus 
mnus 
Minus 
Minus 
Minus 
Minus 
Minus 
Minus 
Minus 
Minus 
Minus 
Minus 
Minus 
Mnus 
Mnus 
Minus 
Minus 
Mnus 

Minus 

Minus 

Minus 

Minus 

Minus 

Mows 

Minus 

Minus 

Minus 

Mnus 

Minus 

Plus 

Mnus 

Minus 

Mnus 

Mnus 

Pius 

Minus 

Plus 

Ptus 

Plus 

Plus 

Mnus 

Mmo 

Mnus 

Ptus 

Minus 

Mnus 

Mnus 

Mnus 



2020758-2020664 

2022565-2022497 

26319334631797 

2766043-2765856 

2766207476611 9 

2772278-2772039 

3240494-3240389 

46928864692753 

2631933-2631797 

514454*5144344 

75732187573060 

76664137666091 

821737*6217261 

82177966217670 

12732417-12732289 

12734365-12734269 

13728850-13728751 

16294548-16294360 

16306095-16305998 

20581911-20581794 

2138825021388146 

22597446-22597284 

246509)524650403 

25068943-25068841 

25421215-25421093 

25763806-25763747 

259(057625908440 

25948563-25948411 

26975307-26975239 

27028481-27028377 

27034927-27034811 

29041694-29041500 

3015042330150256 

30156053-30155870 

3404740634047311 

34204707*34204577 

34215091-34214978 

34278373-34278275 

3437582534375698 

34376814*34376596 

20225652022497 

7095797-7095680 

2017428620174193 

24893073*24692972 

33860127-33860047 

69994-70075 

35565-35843 

111056-111783 

1031-1162 

60751-60927 

16995-18101 

57019^9337 

198354-198436 

199909-200001 

9460654785 

180921-181333 

567-962 

89734-89838 

47928-48076 

37052-37204 

46094-46241 

3884-3952 

80557-81051 

87201-87587 

38950-39301 
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TABLE 14: Table 2 from BRCA 001-5 US 

Table 14, a subset of table 13, depicts a preferred group of genes highly upregulated in breast 
5 cancer cells. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

10 UriganeHX Unfeene number 

UnigeneTtfle: Urijjenegenefitte 

R1: Ratio of tumor to noimal breast tissue 



IS PKey ExAccn UnJGenelD UnlgeneTffle 



20 



25 



30 



35 



40 



45 



50 



55 



60 



100038 
100114 
100975 
101031 
101104 
101143 
101332 
101378 
101609 
102818 
102721 
102817 
102907 
102985 
103060 
103180 
1032TB 
103821 
104115 
104667 
105166 
106103 
107151 
109415 
110189 
110561 
110734 
110915 
111179 
111357 
112134 
113970 
114124 
114292 
114901 
114965 
115652 
115875 
116790 
116921 
117412 
120241 
120325 
121596 
123619 
124006 
125852 
126160 



M97935 

D0O596 

J02923 

J05070 

L07615 

L12723 

L47276 

M13755 

M86849 

U65932 

079241 

U90904 

X06985 

X17644 

X57766 

X69433 

X72755 

AA157623 

M428090 

AA007234 

M191512 

AA4211M 

AAS21169 

AA227219 

H20543 

H59617 

H98714 

N46252 

N67239 

N91Q23 

R46025 

W86748 

Z38595 

Z40715 

AA236276 

AA250737 

AA405C98 

AA433943 

H29532 

H72948 

N26722 

Z41815 

AA195651 

AM16740 

AA609200 

D60302 

K09290 

N90960 



H&62962 

Hs.76506 

Hs.151738 

Hs.169266 

H&90Q93 

Hs.156346 

Hs333 

Hs323733 

Hs31071 

Hs.118666 

Hs33724 

Hs202833 

Hs2707 

Hs.155324 

Hs3337 

Hs.77367 

Hs.198793 

Hs261Q2 

Ha30098 

Hs2BQ05 

Hs.12094 

Hs3687 

Hs.110826 

Hsj6278 

H&5199 

Hs24131 

Hs29724 

Hs.10760 

Hs37128 

Hs.7413 

Hs.8109 

Hs.125019 

Hs.184641 

Hs. 196437 

Hs.72472 

Hs38178 

Hs.43946 

Hs.101174 

Hs321 

Hs.42645 

Hs35948 

Ks.104106 

Hs.174104 

Hs270016 

Hs.76550 

Hs265398 



AFFX control: STAT1 

trr/mkrytate synthetase 

tyrnphocyta cytosoifcptoteto 1 (L-pJasttn) 

matrix rrt8taBoproteinase9 (geLaGna^e B; 9 

Human neuropeptide Y receptor Y1 (NPYY 

heat shock 7(8© protein 4 

Homo sapiens (eel One HL-6) alpha topois 

hterfefCfi-stirmilated protein; 15 kDa 

Homo sapiens connexbt 26 (GJB2) mRNA 

(fldraceBufar matrix protein 1 

Human done 23759 mRNA; partial cds 

Human done 23773 mRNA sequence 

heme oxygenase (decycSng) 1 

G1 to S phase transition 1 

matrix rnetalJoprotehase 11 (stromelysh3 

Goctbate Dehydrogenase 2 (NADP*); rrfe> 

monokine induced by gamma interferon 

K1AAD750 gene product 

ESTs 

ESTs 

Homo sapiens mRNA; cDNA DKFZp564G 

ESTs 

ESTs 

trnudeoGde repeat containing 9 

DKFZP586B1 621 protein 

ESTs; WeaJdy strnfiar to UBIQUTT1M-CON 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs; Highly stmfiar to WAA0886 protein 



ESTs; WeaWy s&nuartoR28660_1; parfiat 

ESTs 

ESTs 

ESTs; WeaWy similar to Weak stmfiarflyt 

nitaotubute-assotiated protein tau 

biglycan 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

Homo sapiens mRNA; cDNA DKFZp564B 
ESTs; WeaHyskriBar totranstormatloiHel 
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R1 

16.7 

153 

30.1 

37.2 

183 

17:4 

183 

18.1 

22.5 

23.2 

15 

15.2 

22.7 

20.6 

173 

185 

15.1 

233 

28.7 

163 

193 

15.4 

19 

20.1 

16.6 

193 

302 

212 

37 

15 

17.4 

15 

22 

19.4 

16.9 

35.1 

16.1 

333 

2Z2 

20.7 

18.1 

153 

152 

223 

23.1 

203 

253 

16.4 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



127S77 AA91G752 

Ifc* vl f fVW lUf Wit 


H&264190 


ESTs; Highly similar to MEM3 [Mmuscul 


17J 


128595 U31H75 


Hs. 152677 


short-chain alcohol dehydrogenase family nt 


27.1 


12B717 T30R17 


Hs.1 04222 


Hofrosap^mRNA;cDNADKFZp5S6L 


243 


129124 AA23453D 


H&1088Q2 


l^^fiiyfaTolslnilojB^GBnsffivG tBctor 


20J 


12936& H18Q27 




olsxtnCI 


182 


130455 X170JB 

Iwt^Kfcf AHUM 


hte. 155956 


N-auMlyniuosfefBSP 1 (arytemlnB M-acetytt 


26.4 


130604 XD3635 


Hs.1G57 


estrogen rcoeptor 1 


395 


1 JU9 Id «f 


rEv£ 1 low 


tmrtolnrsMn nl niitpr irtftrtrhmrl rial mprhhta 


20.9 


130944 MOTORS 


MM HOB 




183 


131472 AA608962 


H&27258 


nalrvrSn hktftfnn ncntefli 


18.1 


131562 U90551 


Hs28777 


H9A hfcftnno famihr mpmhpr 1 


183 


132180 AA405569 


Hs.418 


fthmMad arfluaflnn nmiptn" nlnhn* spivasfi 
uwuuuat duivouuii |huc«I| o\y\ax, oc^jiibc 


15.4 


132406 F09979 


Hs.4774 


COI0 


15 


132465 AA047896 


Hs.49169 


PQTq 
cola 


15.4 


132994 AA505133 


H&279905 




2&4 


133294 R79723 


Hs59997 




30.4 


133634 U24166 


H&234279 


lilMUtuPUIO'q&jiAiiaaM yn\JuiU\, i\r/t-u miii 


15J2 


134374 D62633 


Hs3236 


PCTe 
CO IS 


152 


134405 J04177 


Hs32772 




153 


134470 X54942 


Hs33758 


^ft^*)fl ihirfcii-J^ Llifc^oo O 

IhAVO piuEBal KDiaSB £ 


203 


134495 D63477 


Hs34087 


klAAAiA^ ninhMn 
NAAUI'tJ piUiSin 


1R1 


134714 U89922 


Hs390 


lyrnpfKMOoon osia \inr sujrenanijy, memD 




135237 AA454930 


H&3691 


colS 


195 


301884 AA312082 


Hs.105445 


uuNr tanwy recepiof aipna i 


207 


302276 NM.004448H9L323910 


boi ctust^(rtot n uniuenej won exon nn 


91 fi 

CIJO 


302290 AL117607 


Hs.175563 




41.4 


309177 AJ951118 






243 


309583 AW170035 




cot 


645 


310438 AW022192 H&200197 


ESTs 


39.1 


311166 AI821294 


Hs.H8599 


CO IS 


24.1 


312153 AA759250 


Hs.153028 


cyvxniufTiB D-ooi 


27.1 


313915 AB69390 


Hs.163443 


ESTs 


77 1 
£.1.1 


314506 AA833655 Hs206868 


colS 


97ft 

CI JO 


314558 AI873274 


Hs.190721 


ColS 




314691 AW207206 Hs.136319 


COlS 


21.4 


314943 AW76797 


Hs.184572 


C6I QiViSiOn cyoe c, wi id o anu iuvm 


184 


315196 AA972756 Hs.44898 


ColS 


2&B 


316177 AB08272 


Hs2931Q2 


col uUSIBr \IUE ul Uniuene/ 


325 


318073 AW167087 Hs.131562 


CCTe 
ColS 


15.7 


318662 AI285898 


H&294014 


CCTo 
ColS 


163 


318740 NNL0Q2543HSw77729 


col GUStta (iioi in UluooJioy 


213 


318744 A1793124 


Hs.144479 


Co 19 


35 


319668 NMJX12731H337773 


Col CHibUJI \IRX Ul UIBOtSlC/ 


25.4 


320074 AA321166 Hs278233 


per rhiefpr /nnl tn 1 IntRmf^ 


16J 


320211 ALJ0394Q2 


Hs.125783 


nciiCj; nmtoh 
UCMDO |Hilttfln 


243 


320727 U96044 


KH81125 


col ouaita (inn ui wuuciicy 


153 


322818 AW043782 Hi293616 


PCTq 
cols 


21 


322882 AW248508 H&279727 


UlueO^B SjiNimin Oiuufl icgHHi yoic £ 


153 


324251 AL344891 


H&269350 


col ouaiw \ip* ui m wo ic j 


50.1 


324432 AA464510 Hs.152812 


PCT rh icter / rw* tn 1 hriCaiei 
col wuwta \nui w maopucj 


16.7 


324603 AW016378 Hs592934 


POTq 
ColS 


23.1 


324620 AA448Q21 


Hs34109 


EST duster ( not tn UnJGene) 


212 


324S88 T06997 


H5.121028 


EST duster (not in UniGene) 


245 


330388 XD3363 




HER2 receptor tyrosine kinase (oerbB-2;£ 


177 


330486 M13755 


Hs333 


bTteiferoTHsfimuIated pmtein; 15 kDa 


67 


330814 AA015730 H&265398 


ESTs; Weakly sbnfiartD transfoiimtiun-rel 


44.1 


331145 R72427 


K&129873 


ESTs; WeaWy sfcnDar to CYTOCHROME 


41.9 


331306 AA252079 Hs.63931 


4achsWtd(Drost)phte)rttm)k)g 


151 


331890 AA432166 Hs3577 


succinate dehydrogenase complex; sutomft 


243 


332526 AA281753 Hs.77515 


Inosftol 1;4*^-<nphosphate receptor, type 3 


19 


332532 N63192 


Ks.1892 


EST; Highly similar to FrENYlETriANO 


153 


332694 AA2BZ768 H&243901 


WAA1067 protein 


152 


332958 




CH7JLR3ENESw48_15 


173 


333769 




CH22J=GENES\271_8 


483 


333968 




CH22J=GB*ESV307_4 


153 


334223 




CH22J=GEWEoV360_4 


335 


334264 




CH2?J^ENES.367J5 


185 
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335791 CH2*JGB*ES.611J 

336512 CH22_FGBESi34_7 21.4 

338008 CH22_EMAC005500.GBiSCAN.127-9 15.2 



i 
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TABLE 14A 



Table 14A shows the accession numbers for those pkeys lacking unigenelD's for Table 14. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs, These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

10 

Ptey. Unique Eos probeset Henffier number 

CAT number Genedusternumber 
Accession: Genbank accession numbers 

15 

Ptey CAT number Accession 

309583 1046029_-2 AW170035 
20 336512 CHZL3941FG_834_7JJNKJXI 

338008 CH2^6490FG_JJNK-BAAC0O 

333769 CH2^1036FGJZ7UJJNK-B^ 

333968 CH2^1245FG_307_4JJNK_EM 

335791 Cm_3160FG_611jrjJWeEM 
25 309177 A1951118 

332958 OC2J82FG_48_15JJN>eEM: 

334223 Oi2^1507FG_360JJLlW_EM 

334264 CH2?_t551FG_3o7_15JiNK^ 

123619 371681 1 AA6Q2964 AA6092D0 
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TABLE 14B 



5 Table 14B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 14. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Unique number corresponding to an Bos probeset 

Sequence source. The 7 djgS numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et aL" refers to the publication 

ewWedThe DMA sequence of human chromosome 22." Dunham J. et aL, Nature (1999) 402489-495. 
Indicates DNA strand from which exons were predicted. 
Indicates nucleotide positions of predicted exons. 



Pkey Ref Strand NLpostDon 

332958 Dunham. LetaL Phis 251616^2516310 

20 333m DunhanUetaL Pbis 7696625J685707 

333968 Dunham. LetaL Phis 868100*8681241 

334264 Dunham. LetaL Ptus 13234447-13234544 

338008 Dunham, Letat Ptus 7697068-7697236 

334223 Dunham, LetaL Minus 12734365-12734269 

25 335791 Dunham, LetaL Mnus 25948563-25948411 

336512 Dunham, LetaL rVfinus 3427837^34278275 



10 Pkey: 
Ref: 

Strand: 
NLposrOoru 

15 
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TABLE 15: Table 3 from BRCA 001-5 US 
Table 15 shows genes dowmegulated in breast cancer cells. 



5 



Ptey: Unique Eos probeset ktenfflfef number 

ExAccn: Exemplar Accession number, Genbank accession number 

Urogenefl): UnJgene number 

1 0 Unlgene TBe: Unigene gena fife 

R1: Rato ofnoimal breast tissue to tumor 



Pkey ExAccn Un&ene ID Unlgene Tras R1 

100115 000632 Ks.172153 glutat^ peroxidase 3 (plasma) 1.7 

100499 TK3FfcHT1428 1^283108 Olob^Beta 13 

100502 TtGRHT1496 Hs.169228 AAera*$pec&Pi0teiriPg2 23 

100815 TJGRCT4268 HslS739 ^GJyceroWtosp^ 1.7 

20 101125 L10373 Hs.82749 transmembrarK^superfarrfym 13 

101387 M12963 Hs.4 akx^ dehydrogenase 1 (class I); alpha po 23 

101397 M15656 Hs.180878 Gpoproiein fipase 13 

101883 1108399 HsJ5613 CD36 antigen (cofegen type I receptor, thr 1.6 

102227 U25138 Hs.93841 pofcassuimlara^conAjcta^ 1.6 

25 102857 X00129 H&76461 reftw*b&Kflng protein 4; tatersfifial 3 

103211 X73079 Hs288579 pdyinerfchnmunop^fau^ 1.8 

103496 Y09267 Hs.132821 ^amtaWnginmixygenase2 13 

103562 Z21966 Hs2815 POU domain; dass 6; lmreaip8onf^ 13 

104672 AA007629 gJyceioWitaphate dehydrogenase 1 {sol 2.4 

30 105083 AA146619 Hs.18791 ESTs;WeaMysknQarto(^IUMWD 17 

105138 AA164519 Hs.15248 ESTs 13 

106075 AM17915 Ha25930 ESTs 13 

106870 AA487576 Hs^6530 serum deprivatarespor^fjjhospha^ 13 

107099 AA609645 H&211568 eukaryofc transit toffiato 2.7 

35 107616 AA004901 H&261164 ESTs 13 

107997 AA037388 Hs32223 Human DMA sequence from clone 141H5o 1.7 

1083)4 AA099820 Hs.49696 ESTs 2.4 

111130 N64265 Hs.19515 yz44M2*1 Ntorton Fetal Cochlea Homosa 1.7 

111837 R36447 Hs.24453 ESTs 13 

40 112538 R70255 ESTs 1.9 

112808 R97970 Hs^81022 EST 13 

113086 T40652 H&208100 DKFZP434C171 protein 13 

115740 AA418033 Hs.283559 ESTs 13 

115949 AA443800 Hs.43125 ESTs 2 

45 1159S5 AM46661 Hs.173233 ESTs 22 

117224 N20300 H&218707 ESTs 1.7 

117513 N32174 Hs.44317 SRY(sex^Brrnlr^rEgwiYH)ox 10 1.7 

119059 R15436 Hs77869 Frfeo^aiaxtaregk*igeneX123 17 

119175 R71792 Hs3010Q2 ESTs; WeaWy simfer to celt deaft acovato 23 

50 119359 T71Q21 . Ha285681 ESTs;tWsirrtotoV^todc4^elb(-too 13 

119798 W73386 Hs349129 ESTs 3 

120889 AA365784 Hs37044 ESTs 13 

121381 AA405747 Hs37984 ESTs; WeaWysim3artoWASP-tamDy pro 13 

121750 AA421184 Hs37549 ESTs 13 

55 122127 AA434447 H&106771 ESTs 23 

122348 AA443685 Ha293410 ESTs 2.1 

122485 AA448300 Hs.160318 phosphdeiranan 13 

123443 AA598841 Hs.167382 natfureflc peptide receptor AJSguanybtocy 13 

.123505 AA600135 ESTs; Moderately sknBar to llll ALU SUB 13 

60 125284 W94688 Hs.103253 peripto 17 

126300 D81972 HUM427DT^HunianfeybratonFupw 13 

126747 R72515 Hs.160318 ptospridernman 13 

127218 AA309765 Hs.116017 ESTs;Wea^sfrn8artoKlAA0795protel 13 

127357 AA452788 Ks.75432 zx39g1tr1 Sc^JrtaUetosJfo2HF8_9 17 



257 



WO 02/059377 



PCT/DS02/02242 



127638 AA634405 Hs.122608 ESTs 1.5 

128213 AA972780 Hs.129194 ESTs; WeakJyslmaar to till ALU SUBFA 13 

128351 AI092391 Hs.134886 ESTs 13 

128842 N44757 H&20340 ESTs 16 

5 128870 R71403 Hs.75309 eukaryoSctranstatoe(ongafk)nfa^2 1.7 

129146 AA459944 Hs.108324 DKFZP586P1422 protein 13 

129285 T62068 Hs.11006 ESTs 2.1 

129331 N93465 Hs779772 ESTs; Highly simDar to CGI-38 protein [H 15 

* 130085 M62402 H&274313 teufcttte growth bcfr 1.7 

10 130400 M25079 Hs^83108 bemogtoWn; be!a 17 

131267 AA211776 Hs2504 myomesln 1 (stetemln} (185kD) 33 

131277 AA131466 H&23767 ESTs 19 

131282 M12272 Hs.4 alcohol dehydrogenase 3 {cbssOigarmnap 22 

131304 AA295848 H&S475 aquaportn7 17 

IS 131810 D49487 Hs.194236 tepfin (murine obesity homotog) 23 

132788 AA045503 K&56874 ESTs; WeaWysimfer to Homo sapiens p2 16 

132931 Z41452 H&6Q90 tmad ^bMdefts^chwmoson^iGg} 13 

133120 X64559 Hs35424 tetranectin (pbsmhcx^ 2 

133314 U95367 Hs.70725 ganro^ni^tyncacid{GABA) Arecep 13 

20 133507 X74295 Hs.74369 mtegrti;aJoha7 17 

133601 S95936 Hsl284176 transferrin 23 

133702 N56898 Hs.75652 glutathione Stonsferase US 19 

134111 N79674 Hs3022 TU3A protein 4.6 

134699 U56814 Hs38646 deoxyAonucfease Uto 3 15 

25 134749 L10955 H&89485 carbonic anhydroe IV 16 

135173 M72885 Hs35910 Hi0r^GOS2protehgene; cornptetecds 13 

300132 AW027556 Hs.156286 ESTs 17 

300732 AJ369956 Ha»7891 ESTs 13 

300750 AA514B05 rte293055 ESTs 13 

30 301140 AI807692 Hs.129129 ESTs 1.6 

3013% AA923549 H&224121 ESTs 2.1 

302910 N77976 H&251577 hemoglobin; alpha 1 18 

303798 V00505 H&36977 rremogtobft aefta 16 

303831 T04868 Hs.46780 EST duster (r)drnUraGerte) with exon hit 17 

35 303844 U94362 K&58569 grycogenin2 13 

304182 H91086 EST singleton (not In UniGeneJwrlhexon 13 

304622 AA516384 ESTst^on(nx4inUniGene)wrBiexon 13 

304682 AA550994 ESTsmgtetDn(notnUniGene)w^e»m 17 

305612 AA782347 H&272572 ESTsl^^(nothUniGene)wffliexon 13 

40 306193 AA923457 EST singleton (not in UriGene) wflh exon 13 

307206 AI192534 EST singleton (not in UniGene) wffii exon 13 

307377 AB22691 EST singteton (not hUnKenejwfih exon 13 

308023 AI452732 H&251577 EST s^leton (not friUnfGene)w8h exon 19 

308359 AI612774 Hs.79372 retinoid X receptor, beta 13 

45 3)9838 AW296073 H&255504 EST 13 

310403 AI720978 Hs.148006 ESTs; Ktoderately similar to artemafively 3 13 

311671 AW241947 Hs332478 ESTs 13 

311794 AW238092 Hs254759 ESTs 2.1 

312082 T79860 Hs.118180 ESTs 13 

50 312575 H25237 H&306814 ESTs 23 

313076 N49684 Hs.143040 ESTs 13 

313283 W32480 Hs.157099 ESTs 22 

313374 AW328672 Hs.132760 ESTs 13 

314701 AI754634 Hs.131987 ESTs 1.7 

55 315391 AA759088 Hs.192007 ESTs 13 

315688 AA680G55 Ka264885 ESTs 13 

316249 AA948612 Ks.130414 ESTs 13 

316588 M205077 Hs294085 ESTs 1.7 

316890 AA837079 H&24647 ESTs 13 

60 316983 AM802M Hs.177131 ESTs 13 

317604 AI650625 Hs300756 ESTs 13 

317951 AW205520 Hs.129621 ESTs 13 

319400 W26902 Hs.154085 ESTs 1.7 

320757 H22654 Hs.6382 EST cluster (not in UnrSene) 13 

65 321594 AA021402 Hs.11067 ESTs 17 

322102 H45569 EST cluster (not In UnKSene) 13 

322814 A1824495 Hs*11038 ESTs 22 
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322929 AI365585 Hs.146246 ESTs 23 

323831 AA335715 Hs.200299 ESTs 1.7 

324044 AL045752 H&22330 ESTs 13 

324675 AW014734 Hs.157989 ESTs 22 

5 32S272 . caiUiS9P669Q2 15 

325556 CH.12JtsgP56302 15 

325656 cai4Jsgi|G056305 15 

326120 CR17_hsgP67194 15 

326139 Cai7Je g!p867203 15 

10 326855 Ca20Jegl]65524G0 15 

327438 CaOLhsg56004454 15 

329733 CK14jJ2gP)65783 15 

330931 F01443 Hs384256 ESTs 45 

331591 N71677 Hs.42146 ESTs 15 

IS 332158 AA621393 Hs.112984 EST 15 

332364 W94688 Hs.103253 perifipin 2,1 

332502 H21819 Hs.14896 Homo sapiens dona 24590 mRNAseQuenc 15 

334175 CH22_FGENES549_10 15 

334347 CH22JGENES375J\ 13 

20 334737 CH22JGBIES.424J2 13 

335352 CH22JGBJES539_5 15 

335639 CH22JGENES584J9 15 

336244 CH22J=G0CS.746J2 15 

336336 CH2^FGENES314ja 17 

25 336865 CH22J=GENE&305-1 15 

337494 CH2JLFGENESJ99-12 15 

337764 CH22^tAC000097.GENSCAN.119-1 13 

337983 CH22.BAACO055OOJGB4SCAN.11O-1 2 

338192 CH22_EMACO(S500X5ENSCAN^28-1 15 

30 339366 CH22_BA354l1ZGBlSCAN34-2 15 
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TABLE ISA 



Table 15A shows the accession numbers for those pkeys lacking unigenelD's for Table 15. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs, These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Ptey CAT number Accession 

126300 250375^2 D81972 BE003132 
20 112538 504579J M908813R70255 

123505 genbanfcJ\A600135 AA600135 * 

104672 6735_7 AA34909B A1368018 F21390 F17759 R48772 AW21485 AI300352 H43971 AI378525 F33652 R47898 AI264177 F22289 N28263 



AIZ76281 R48205 AI245302 A1190036 AI281050 AW242903 H42892 AA910870 AW473816 H25721 AW451438 F19647 F22375 
H45809F33447M774528AA007629W2537C01077F32386 



25 322102 46708.1 H45589 H13807 AF075038 H19808 H42437 

336865 CH22_4590FG_305_1_ 

338192 CH2£_6755FG _JJN)^EMAC00 

329733 c14 j>2 

326120 c17_hs 
30 326139 c17_hs 

326855 c20_hs 

335352 CH22J699FG.539 JJJNKJEM 

335639 CH22L2999FG_584J9JJNK.E 

307206 AI192534 
35 307377 AI222691 

337494 CH22_5727FG_799J2_ 

337764 CH2^6115FG_JJWeQtAC0O 

337963 CH22_6438FG_MNK_BAAC(K) 

339366 CH2^833©=G_UN»eBA354n 
40 325272 dljB 

325558 c1ZJ» 

325656 c14J» 

334175 CH2*J455FG_349_10JJNKJ 

304182 H91086 
45 334347 CH2^.1640FG_375_31JJ)«eE 

327438 CJU» 

304622 AA516384 

334737 CH22J049R5_424 

304682 AA550994 
50 336244 CH2i.3642FG_746JULINKJW 

306193 AA923457 

336336 CH22_3746FG_814_8_UNK_BA 



10 



Pkey. 

CAT number 
Accessions 



Unique Eos probeset identifier number 
Gene duster number 
Genbank accession numbers 



15 



260 



WO 02/059377 



PCT/US02/02242 



TABLE 15B 



10 



15 



25 



30 



35 



Table 1SB shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 15. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 



Pkey. 
Ret 

Strand: 
NLposfflon: 



Ptey Ret 



Unique number corresponding to an Eos ptobeset 

Sequence source. TteTdb&iiuntetshthbcolun^ TnmhamLelaL'rB^tothepiibCcaaon 

en^TheDNAsequaiceofhun™ Dunham L et aL, Mature (1399) 402:489495. 

Indicates nucteotkle positions of predicted exons. 



Strand NLposiflon 



334347 Dunham, L 
334737 Dunham, L 
20 335639 Dunham, L 
Dunham, t. 
Dunham, L 
Dunham. L 
Dunham, L 
Dunham, I. 
Dunham, L 
Dunham, I. 
Dunham, L 
Dunham, L 
Dunham, L 



337494 
334175 
335352 
336244 
336336 
336865 
337764 
337983 
338192 
339366 
325272 
325558 
325656 
329733 
326120 
326139 
326855 
327438 



6056302 
6056305 
6065763 
5867194 
5867203 
6552460 
6004454 



etaL Plus 
etal Ptus 
etaL Ptus 
etaL Ptus 
etal Minus 
etal Minus 
etaL Minus 
etaL bffinus 
etaL Minus 
etaL Mnus 
etaL Mfaus 
etaL Minus 
etaL fctinus 
Minus 
Ptus 
Mnus 
Ptus 
Ptus 
Minus 
Minus 
Minus 



13663814-13663926 

15998517-15998685 

25173591-25173896 

33339024-33339148 

11668659-11668597 

22661512-22681384 

3140272^1402583 

33797209-33797076 

862240S8622289 

4035640-4035446 

7275495-7275271 

13248453-13248277 

33647431-33647293 

13247-13312 

7093071030 

7819078707 

163237-163450 

3611636276 

218901-218960 

111390-111463 

199569-199692 
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TABLE 16: Table 4 from BRCA 001-5 US 

Table 16, a subset of table 15, depicts a preferred gr6up of genes highly downregulated in 
breast cancer cells. 



Pkey. IWqueEosprobesettdenfferaimber 

ExAccn: . Exemplar Accession mmiber.GonbankacoesslcmmBriber 
10 Urdgenett>. Un^ara number 
Unigene Title: Unigene gene fitte 
R1: Ratio of normal breast fissue to tumor 



15 



20 



25 



30 



35 



40 



45 



Pkey ExAccn 


UmGenelD 


Unigene Tife 


R1 


100902 T1GFWT1496 


Hs.169228 


Adrenal-SpecfficPiotesiPg2 


2.3 


101367 M12S63 


Hs.4 


afeohot dehydrogenase 1 (class I); alpha 


2,9 


102857 X00129 


Hs.76461 


ititiiKit-blhdffg jmuIbIii 4; interstitial 


3 


104672 AA007629 




gSycerot-3-ptasphatede)^ 


Z4 


107099 AA609645 


rte-211568 


eufcaryofc transfefan (nifiaflon factor 4 gam 


Z7 


108604 AA09982O 


Hs.49696 


ESTs 


2A 


115949 AA443600 


Hs.43125 


ESTs 


2 


115965 AA446661 


Hs.173233 


ESTs 


2.2 


119175 R71792 


Hs3010Q2 


ESTs; Weakly snnQar to ceB dearth activator 


2.8 


119798 W73386 


H&249129 


ESTs 


3 


122127 AA434447 


Hs.106771 


ESTs 


ZS 


122348 AA443695 


Ksl2S3410 


ESTs 


2.1 


129285 T62068 


Hs.11006 


ESTs 


2.1 


131267 AA211776 


Hs^504 


myomeski 1 (sketemm) (185kD) 


33 


131282 M12272 


Hs.4 


alcohol dehydrogenase 3 (class 1); gamma 


22 


131610 D49487 


Hs.194236 


tepfin (rnunreobes^homobg) 


2.5 


133120 X64559 


Hs.65424 


tetranectin (ptasmfexigerKhbsfing protein) 


2 


133601 S95936 


Hs384176 


transferrin 


Z3 


134111 W79674 


Hs£Q22 


7U3Aprotek) 


4.6 


301396 AA923549 


Hs224121 


ESTs 


2A 


311794 AW238092 


Hsl254759 


ESTs 


2A 


312575 H25237 


H&306814 


ESTs 


22 


313283 W32480 


H3.157099 


ESTs 


21 


322814 A1824495 


Hs211Q38 


ESTs 


22 


322929 AI365585 


Hs.146246 


ESTs 


Z3 


324675 AW014734 


Hs.157969 


ESTs 


22 


330931 F01443 


H&284256 


ESTs 


4.6 


332364 W94688 


Hs.103253 


perflpki 


2.1 


337983 




CH22^BtACOO55OO.G0ISCAN.1ia-1 


2 
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TABLE 16A 

Table 16A shows the accession numbers for those pkeys lacking unigenelD's for Table 16. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Ptey: Unique Eos probeset fctertffierrairnber 

CAT number Gene duster number 
Accession: Genbank accession numbers 

Pkey CAT number Accession 

104672 67357 AA349096 AJ368018 F21390 F17759 R48772 A1421485 AJ30G352 H43971 AJ378525 F33652 R47898 AI264177 F22289 N28263 
AI27B281 R48205At2453Q2 AJ190036 AI281050 AW242903 H42892 AA910870 AW473816 H25721 AW451438 F19647 F22375 
H458C»B3447AA774528AAOO7629H4S37Cai077F^386 
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TABLE 17: Table 1 from BRCA 014 P 

5 Table 17 shows accession numbers representing 759 sequences of breast cancer genes or 
fragments thereof encoding breast cancer modulating proteins. For each overexpressed gene 
identified, a ratio of the relative amount of expression in breast tumors versus nonnal breast 
tissue. 



10 

Ptey: Unique Eos proteset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnJgenelD: Umgene number 

Unigene Title: LMgenegenetffle 

15 R1; Rafio of tumor to norma] breast tissue 



Pkay ExAccn UmgeneD Undone Title 



R1 



00227 
00405 
00406 
00420 
00911 
00360 
101011 
01183 
01194 
01329 
01378 
01474 
[01491 
01530 
01602 
01663 
101758 
101767 
01817 
01851 
01878 
02209 
102214 
02297 
02299 
102301 
02305 



AV654694 

AW291587 

AI962060 

D86983 

X83300 

J00124 

BE387036 

AA442324 

L20971 

U66042 



102591 
02721 
02739 
102791 
02804 
02903 
03010 
03042 
03117 
03207 
I03282 
I03284 



I03364 
03385 
03456 



R07566 

M25809 

M29874 

AA353776 

NMJ03528 

BBJ19494 

M81057 

M89907 

BE260964 

M97815 

NM_002038 

U23752 

NMJD01504 

NM.005824 

NH.005651 

AL0432Q2 

U39840 

U62325 

H16646 

AA363025 

AF08Q229 

NM.002318 

M73779 

X525Q9 

T81656 

X63578 

X72790 

BE390551 

A1751601 

XB5134 

X90872 

NM^007069 

AA496425 



HsA2316 

Hi82733 

Hs.118397 

Hs.118893 

Hs289103 

Hs.1 17729 

Hs.1211 

Hs.735 

Hs.188 

Hs£2171 

H&833 

Hs.73817 

Hs.64173 

Hs.1360 

KsJOl 

Hs2178 

Hs.79217 

Hs.180884 

Hs.152292 

H&82D45 

Hs.183650 

HsJ26SJ27 

H&32964 

Hs.198252 

Hs.155545 

Hs.163671 

Hs.90073 

Hs299867 

Hs324125 

Hs.118666 

Hs.155572 

K&83354 

Hs2505Q5 

Hs.161640 

Hs252259 

H&295449 

HsJ7628 

HsJ375 

Hs.72984 

Hs279929 

Hs37l89 

Hs.9629 



interferotvlndoced, hepatitis C-associat 
nMogen2 

AE-bfofing protein 1 
Melanoma associated gene 
SMA4 

keratin 14 (epidermolysis buttosa simple 
add phosphatase 5, tartrate resistant 
H2A historte famfty, member O 
phcsphocfiesterase 4B, cAMP-specffic (dun 
Homo sapiens done 191B7 placenta expr&s 
tntBTferjCfKS&nulated protein, 15 kDa 
small Inducible cytokine A3 (nornotogciB 
ATPass, H+transportng, lysosomal (vacu 
cytochrome F450, subfemly HB (phenobar 
CQ48 antjgen (B-ce* membrane protei)) 
H2B histone family, member Q 
pyrrcfine-5-carboxytetB reductase 1 



SWITSNF related, matrix associated, acfi 
mkfldne (neur2& grcwftvprocriafing factor 
cefluiar retinoic add-bhdirtg protein 2 
Interferon, a^ha-lndudbte protein (do 
SRY (sex detem*iiig region Y>box 11 
G protein-coupled receptor 9 
37 KDa leodne-rtch repeat (LRR) proteh 
tryptophan 2,3-cDoxygenase 
diromosonte segregation 1 (yeast homobg) 
hepatocyte nuclear factor 3, alpha 
arnytoW beta (A4) precursor proteavbW 
rrypofoefical protein PP991 
Human done 23801 mRNA sequence 
gW-tuman endogenous retrovirus K done 1 
Iysytax>dase^lke2 
retinoic add receptor, alpha 



ribosomal protein S3 
parvalbumin 

gcd-fuman endogenous retrovirus mRNA for 

steroidogenic acute regulatory protein r 

TNF receptor-assodated factor 4 

retncMastoma4Axflng protein 5 

gp25L2 protein 

similar to rat HREVf 07 

paptDary renal cefl carcinoma (transtoc 

264 



3 

32 
3.6 
32 
52 
4.3 
3 

32 
3 

4.1 
5.3 
3.9 
4.5 
9 

3.4 
5.6 
3.6 
12 
32 
4.1 
6£ 
3 
3 

3.7 
3J 
52 
35 
3.9 
4 

3-5 
32 
3 

32 

3.3 

1£4 

4.5 

3 

5.9 
33 

a3 

3.1 
3 

3.4 
32 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



103558 
103563 
103612 
103825 
104073 
104103 
104115 
.104168 
104173 
104181 
104189 
104269 
104307 
104518 
104556 
104658 
104748 
104755 
10482 
1048% 
104865 
104906 
104961 
105036 
105088 
105092 
105093 
105304 
105397 
105409 
105431 
105552 
105598 
105650 
105688 
105808 



105909 
105965 
106135 
106184 
106233 
106400 
106474 
106484 
106533 
106614 
106636 
106661 
106743 
106844 
106864 
106865 
106871 
106942 



107105 
107158 
107248 
107265 
107630 
107710 
107890 
107985 
108000 
108217 



Y09306 

BE616547 

102911 

BE336654 

AI571835 

AW779318 

AW021102 

AF183810 

AA461618 

AA084273 

AF173296 

AB040S27 

AI559444 

AI929700 

H2Q816 

AVGS0851 

AA360954 

AA015879 

T49951 

AA035613 

AW294092 

T79340 

BE298684 

H78517 

AW503733 

H585B9 

AA148982 

AL137566 

AW134924 

AA814807 

AW505076 

AA252033 

AA256750 

AAZ79439 

W16741 

AG99139 

AI133161 

AW973653 

AA195191 

AA131657 

AL117474 

W28948 

N39842 

BE397649 

BE383668 

AA351978 

All 34708 

AA648459 

AW958037 

AW499914 

BE613328 

AA485055 

AB11928 

AW192535 

AW4729B1 

AA995351 

AF216751 

AW963419 

N32849 

AW263124 

BE379594 

AW961576 

A1955040 

AAQ25386 

T4Q064 

A1263307 

AA0586B6 



Hs30148 

Hs2785 

Hs.150402 

Hs.70937 

H&55468. 

Hs.88417 

H&21509 

H3261Q2 

Hs31704 

Hs.76561 

H&283740 

Hs301804 

Hs^93960 

Hs.111680 

Hs.112423 

Hs£6900 

Hs^7268 

Hs33536 

H&9029 

Hs.141883 

Hs^1594 

Hs^2575 

HSJ268Q2 

Hs33905 

HsS414 

Hsl35156 

Hs29068 

Hsl32405 

Hs.190325 

Hs.7395 

Hs301855 

H&242413 

Hs*28802 

Hs279763 

H&25635 

Hs.17517 

Hs386131 

HsJSHM 

Hs5111 

H&23830 

Hs.41181 

Hs.10762 

Hs301444 

Hs£4109 

Hs.42484 

Hs.4943 

Hs.145998 

H&335951 

Hs286 

Hs.7579 

Hs21938 

Hs.158213 

Hs.19479 

Hs321130 

Hs31314 

Hs£68l3 

Hs.155223 

Hs31B44 

Hs315111 

Hs.49136 

Hs£0178 

H&265388 

H&31311 

Hs.71968 

H&239884 

Hs£2588 



homeodofnaWnteractfng ptotetn kinase 3 
keratin 17 

AcSvh A receptor, ^pe I (ACVR1)(ALK 

H3 hfetone family, member A 

ESTs 

ESTs 

ESTs 

opposite strand to trichoTtitrtophaianQeai 
ESTs, Weakly slmflar to WAA0227 pisap* 
ESTs, WeaWy sinter to S47072 finger pr 
DCS protein 
K1AA1494 protein 
ESTs 

endosuffine alpha 

Homo sapiens mRNA; cDNA DKFZp586l1420 (f 
hypoftefcaJ proten; WAA1830 proteh 
Homo sapiens cDNA: RJ21933 fis, done H 
ESTs 

DKFZP434G032 protem ' 
ESTs 

hypothetical protein MGC15754 
B^CU^nphcinffl6,rf)ernberB(ztocfi 
protein kinase domains containing protei 
ESTs 

K1AA1438 protein 

Homo sapiens cDNA aJ1 1 027 Bs, clone PL 
ESTs 

Homo sapiens mRNA; cDNA DKFZp586G0321 (f 
ESTs 

hypothetical proteh RJ23182 

DiGeorge syndroms crffcai region gene 8 

hypothetical prateh 0KFZp434K1421 

centaurirKityha 2 protein 

hypoftetk^profehFU10504 

HSPCWJ3protesf> 

ESTs 

CGMOIprotefo 
bypc^etkd protein RJ00052 
hypotheticai protein FU20729 
ESTs 

Homo sapiens mRNA; cDNA DKFZp727C191 {fr 

ESTs 

WAA1673 

Homo sapiens cDNA RJ13634 fis. done PL 
hypomefical protein FU10618 
hepatoceflularcafdnoma associated prot 
ESTs 

hypothetfcal proteh AF301222 
ribosomal piutext L4 
hypothetical protein FU10402 
hypo&teScaJ protein FLM2492 



0fcqo89hO4 Jtl NCLCGAP_KW5 Homo sapiens 
ESTs 

hypomefcal protein MGCZ771 
refinobtastema-bindbig protein 7 
CDA14 

startniocatti2 
hypotheScaJ proteh HJ125B6 
mroton3ce^aHepressomMDAC3corTip 
ESTs, Moderately sfrrrffer to ALU7_H UMAN A 
ESTs 

ESTs, Weakly simfer to bansfoimafiofH 
ESTs, WeaHysfrnBarto S10590 cystene 
Homo sapiens mRNA; cDNA DKFZp564F053 (fr 
H2B hfctone famDy, member L 
ESTs 



3.4 
37 
3.2 
4.5 
4 

3.8 

4.3 

7.6 

3.6 

4 

3 

3.2 

43 

3.1 

32 

4.4 

32 

32 

AS 

6.9 

11.1 

3.5 

6.5 

3.6 

4.5 

3.8 

3 

4.8 
8.2 
3.1 
4.2 
4.4 
3.2 
35 
3.7 
5.5 
3.5 
3.3 
3.2 
33 
32 
33 
4.1 
3.1 
12 
7.8 
3 

33 
33 
3 

A2 
3.4 
4.4 
35 
4.1 
3.6 
53 
3.4 
3.1 
53 
33 
AS 
3 

3.1 
4.8 
33 
33 
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108435 TB2427 Ha.194101 Homo sapiens cDNA: FU20869 fis, done A 3 

108591 AB033073 Hs.43857 simfer to gluccsamlre^tfatases 33 

108733 AA121Q22 gtezn84f10j1 Stratagene tung carcinoma 33 

108771 AF068290 Hs.79741 hyrjofoefcaJ protein RJ10116 6.1 

5 106819 AA011449 H&2716Z7 ESTs 3.6 

108912 AA136674 Hs.118681 EST 3.9 

109086 AF186114 H&270737 tumor rieorosb factor (figand) superferrd 3.7 

109124 AKD00684 Hs.183887 hypc)G)8§caJ protdh RJ22104 3.1 

109132 AI970536 Hs.16603 hypothetical proteki FU131B3 3.7 

10 109163 N23235 , Hs3Q567 ESTs,WeaWysWartoB34087h)fpo« 45 

109277 AA196443 Hs36043 Homo sapieiB cDNA RJ13558 fe, clone PL 3.7 

109410 AW504732 H&21275 hypoftrfcaJ protein FU1 1011 4.6 

109454 AA232255 Hs.295?3? ESTs, Moderately sirnSar to A4S010 X-En 6.4 

109514 AA234087 Hs^62346 ESTs, WeaWy similar to S72482 hypofoeti 4.8 

IS 109581 R45584 H&23Q25 EST^WeaWysbitotoALUS.HUMANALU S 33 

108632 AA325138 H&235873 hypothetical protein HJ22672 3 

109644 AW973964 H&291531 ESTs, Highly similar to 1203217A dehydro 3 

109700 F096Q9 gfcHSC33H092 normafized infant brain cON 3.2 

109768 FD6838 Hs.14763 ESTs 32 

20 109807 R43646 Hs.12422 ESTs 3.8 

109842 AW818436 H&23590 sotote canter famHy 16 (nwwcarboxyfc 33 

109895 AKD01660 Hs30488 DKFZP434F091 protein 3.6 

110024 AW973152 Hs31050 ESTs 4.2 

110561 AA379597 Hs3199 HSPC150 protein simSar to ubtquErwon 5.1 

25 110675 H89355 H&249159 adrenergic. alpha^A-, receptor 53 

110707 A1239832 Hs.15617 ESTs, WeaWy SBrflar to AUMJWMAN ALUS 3.7 

110915 BE092285 Hs^9724 hypofireflcal proteh RJ13187 3.7 

111139 N64683 H&290943 ESTs 4 

111155 N66563 Hs.191358 ESTs 3.1 

30 111199 AI767435 H&29822 ESTs 4.5 

111336 AW57338 HsJ29894 ESTs 5.4 

111510 R07856 Hs.16355 ESTs 3.2 

111532 R08440 gtryf19fD9^1 Soares fetal Bver spleen 3.1 

111689 AA6Q2004 H&23260 ESTs 32 

35 111823 R35252 H&24944 ESTs, WeaWy similar to 210926QABceB 33 

111676 R38239 H&293246 ESTs, Weakly similar to putative p150 [H 3.1 

111892 AA421081 Hs.12388 ESTs 3.4 

111893 AF070526 Hs.13429 Hcrno sapiens ctone 24787 rnRMsec^errc 33 
112125 AW379029 Hs.1 18338 ESTs, WeaMysfrnBar to unnamed protein 4.4 

40 112170 BE246743 H&288529 hypothetical protein FU22635 73 

112287 AB033064 Hs334806 WAA1238 protein 32 

112300 H24334 Hs^6125 ESTs 4.4 

• 112303 R54797 gtKyg87b07^1 Soares infant brain 1N1BH 3.4 

112478 R66067 Hs^8664 ESTs 8J2 

45 112561 AI791493 H&129873 ESTs, WeaWy simflar to A3OT36 cytochTom 53 

112631 RB2040 gb.~rfisb06.s1 Soares placenta Nb2HP Homo 3.9 

112637 R82331 Hs.164589 ESTs 5.4 

112657 AW844878 H&.19769 hypofoeficai protein MGC4174 32 

112678 AI418466 Hs33665 ESTs 47 

50 112917 AA082465 Hs.125031 dwBne/ethand^^ 3.7 

113070 AB032977 H&6298 WAA1151 protein 3.1 

113095 AA8283B0 Hs.126733 ESTs 3.4 

113117 AW813731 Hs.159153 ESTs, Moderately sfmflar to S65657 alpha 3.4 

113187 BE613410 Hs31575 ^C63,enA)pbsnriciBfiaihOTitra 32 

55 113200 T57773 Hs.10263 ESTs 33 

113206 BE262470 HsJ241471 RNB6 02 

113374 179925 H&2691G5 ESTs, WeaWy stmBar to ALU1_HUMAN ALU S 37 

113440 U54727 Hs.191445 ESTs 3 

113494 T91451 Hs36538 ESTs 3.4 

60 113518 AW367788 Ks323954 postmetoScsegregaSon increased 2-flte 3.1 

113571 AI7Q2609 Hs.15713 rr/pofiietteai protein MGC2776 3.1 

113822 NKL004585 Hs.17466 retook; add receptor responderflazaro 19 

113835 AB12410 Hs27475 Homo sapiens cDNAFU12749 fe, done NT 3 

113938 W81598 gfczrt8Bg02*1 SoaresJetaUieanJWHISW 4.6 

65 113947 W84768 gfczh53dQ3.8l Soares JetaUver spteen. 3.1 

113970 WZ7249 Ks3109 hypodietfcalprotofriFU21080 6L9 

114086 AA378776 Hs288649 hypothetical protein MGC3077 43 
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AW470411 
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AI733881 
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Hs388433 

Hsw267596 

Hs.10S469 

H&82226 

Hs.72472. 

Hs.193657 

H&88155 
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Hs^89038 

Hs36693 

H&109221 

Hs334827 

Hsw55158 

H&88143 
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Hs.70202 
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H&2S0646 
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H&83484 
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H&166254 
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HsS0081 
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Hsl55209 
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H&287820 

H&560D9 

Hs.43670 

Hs£4859 

Hs.194695 

Hs.119571 

t&59529 
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Hs31145 

K&292581 

Hs301872 

H197579 

KsJ7387 

Hs.196029 

HsJ7697 

Ks.189095 

H&301570 



neurotrfrrtfn 
ESTs 

suppressor of vafl (Sxerevfsfee) 

glycoprotein (transmembrane) nmb 

BMP-RIB 

ESTs 

ESTs 

ESTs, ^gWysfrito to A55713 Inositol 



hypofoefcal protein MGC4126 

ESTs 

ESTs 

ESTs 

ESTs, Weakly sJmDar to T29520 hypoftefi 
ESTs 

ESTs, Wealdy stmOar to ALU7J1UMAN ALU S 

ESTs 

ESTs 

riypothedcal protein FU21816 
ESTs 

WAAQ966 protein 

hypoffieficai protein FL113852 

Homo sapiens mRNA; cDNA DKFZp727C191 (fr 

poJy(A^specffic rtoonudease (deadenyta 

NMJM 6931 Homo sapiens NADPH 0)ddase4{ 

WD repeat domain 10 

CGU»proteh 

ESTs, Moderately simitar to A46010 X* 
EST3,Moderatriys!ntotoALU8_HUMANA 
bacubviral IAP rerjeafcxmtaWng 6 
LBPproteto32 

SRY (sex o^termtohg region Y>bax 4 
hypothetical protein RJ10252 
gfcns44»5*1 NCLCGAP^AM Homo sapiens 
ESTs,htoderate^sirnaartoZN75_HUMANZ 
ESTs, WeaWy simiar to 176885 serhefth 
Homo sapiens mRNA fid length insert cDN 
ESTs 

DKFZP434N014proteh 
hypomefcal protein DKFZp566I133 
hypothetical protein FU21802 
FKBP-associated protefn 
ESTs 

Horr»osap^rnRNA;cDNADKFZp564B182(fr 
MAA1199 protein 

ESTs, Moderately stotBarto ALU8JRJMAN A 

!4c<r«sapierismRNA;cDr^DKFZp 

chemokhe binding proteh 2 

ESTs, Moderately stmBar to AUIIJftlMAN A 

fiironecSn 1 

2^-dfooademfete synftetase 3 (100 k 

WrtesmfamBy member 3A 

ESTs 

ras homoJog gene famBy, member I 
cc&agen, typa Dl, alpha 1 (Ehters-Dard 
ESTs, Moderator/ statttartoAUOUMANA 
ESTs, WeaWy sfmSar to S65824 reverse t 
rr/pofoetJcaJ protein RG083M05\2 
gbcnc21d06j1 NCLCGAPJYI Koroo sapiens 
ESTs, Moderately shtotoALUIJiUMAN A 
hypometical protein MGC4840 
ESTs, Weakly sfctiar to A46010 X-Onked 
ESTs 

Homo sapiens mRNA far WAA1657 protein, 
EST 

stmSartoSALM (sal(Drosoph9a)-!ike 
ESTs 
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121351 AW206227 Hs£87727 rrypoftefcaJ protein FU23132 5 

21611 M31669 Hs.1735 inhlWn, beta B(adMnAB beta polypep 16 

[21643 AA640987 Hs.193767 ESTs 53 

21770 NMJM5902 Hs278428 progestin induced proteh 3.4 

122125 AK00O492 Hs388GS - hypofl^etfcal protein 4.1 

I22338 AA443311 Hs3899B ESTs 3 

22417 AA44G965 Hs.1 12092 ESTs 4.7 

122513 AI787879 Hs39214 ESTs 33 

J22544 AW973253 Hs^92689 ESTs 3 

10 122655 AA323296 . Hs37837 Homo sapiens mRNA; cDNA DKFZp547J047 (fr 5.6 

122806 AA526911 Hs32772 collagen, type XL alpha 1 3.2 

22851 AW2Q5931 Hs39598 hypofte&ca) protein MGC5338 8.6 

123105 AA487809 Hs.166011 catenh (cadnertKSsodated protein), d 3 

.23111 AA22S776 Hs.191721 ESTs 6.9 

15 123249 AA371307 Hs.125056 ESTs . 3.6 

I23273 AA491253 Hs.173611 Emptncafiy selected item AFTC single pr 7 

23385 BE14S685 Hs.17767 WAA1554 protein 3.1 

23419 T66087 Hs.1 12482 Homo sapiens unknown mRNA sequence 3.4 

J23485 A1308876 Hs.103849 r^pofi)e^protdnDJ<FZp761D112 3.1 

20 123645 AI675944 Hs.188691 Homosapfensd^ FU12D33fe, ctonehfi 33 

23819 AA560082 Hs.1 12264 ESTs 4.7 

24012 AA352723 HsJ241471 RNB6 33 

124243 H69125 Hs.133525 ESTs 4.1 

,24357 N22401 gkyw37g07.s1 Morton Fetai Ccchtea Homo 4.1 

25 124359 M22506 Hs.139315 Homosapfens cOHA: FU21479 fis, done C 33 

124667 AW451645 Hs.151504 Hc^ sapiens d)MRJ1 1973 fcs, done HE 3.1 

124911 M34151 Hs.174195 k^t^kit^transmriManspf^ 33 

24972 R413S6 Hs.101774 hypoMcalprotehFLJ23045 43 

.25008 BB065136 Hs.145696 spSdng factor (CC13) 6 

30 125042 T78906 H&269432 ESTs, Moderately staflar to ALU1.HUMAN A 8.1 

125184 W60326 Hs288684 Homo sapiens cDNAFUl 1750 ketone HE 4.7 

125243 AW970536 Hs.105413 ESTs 3.1 

86 AF086534 Hs.187561 ESTs, Moderator/ sfrrtiar to ALU1 J1UMAN A 33 

J25304 AL359573 Hs.124940 GTWjhdhg proteh 3 

35 125330 AW880562 Hs.114574 ESTs 3 

125331 AJ422996 Hs.161378 ESTs 3.2 

85 AI924630 Hs.4943 hpf^tocftftiriar carcinorna associated prot 3.2 

N99638 gb2a39g11j1 Scares fetol fiver spteen 4 

J26474 AW975814 Hs326714 Homo sapiens clone WAGE:713177, mRNA se 4 

40 126666 AA648886 Hs.151999 ESTs 33 

26872 AW450979 gbAJU«l^ala^-12-0^t3l NCI_CGAP_Su 3 

127431 AW771958 Hs.175437 ESTs, Modera^ similar to PC4259ferri 3.6 

27980 AA981459 Hs.125644 ESTs 4.1 

J27S97 AW068311 Hs311054 Honiosaptonsri^UlenpJhbsertcDN 33 

45 1M420 AA65Q274 Hs.41296 fiyor^leuctierichUar^ignibrarep 43 

09 NM_003616 Hs.102456 surv^al<3fniotorr)eujonprotolntotorac 33 

128946 Y1 3153 Hs.107316 kyrwrente 3-rrcnooxygenase {kpwrensie3 3.1 

128955 AA775076 Hs.185807 Homo sapiens, Skr^ to PRO0478 protein 33 

129092 D56365 Hs33S25 pc^rC>btoc%ig protein 2 33 

50 129270 AA357185 Hs.109918 tashofr^amfian%,nien^ 3.1 

129301 AF182277 Hs330780 cytochron»F450»subfa^ 33 

129385 AA172106 Hs.1 10950 Rag C protein 6\2 

129619 AA209534 Hs^84243 telraspanrCHprotefc 3.4 

129629 AKD00398 Hs.11747 hyp«ffte^pr^ai20391 3 

55 129725 X56411 Hs.1219 akshol dehydrogenase 4 (class II), pip 3J2 

130069 AJ754813 Hs.146428 coQageatypeV.a^hal 5.4 

130092 X03363 Hs323910 v-erb^ avian eryfttrobJa^ 4.4 

130296 AJ347487 Hs.132781 dass I cytokine receptor 4.6 

130382 NM-003450 Hs.155204 zmc finger proteto 174 53 

60 130622 A1582291 Hs.16846 ESTs, Weafysirnflarto04HUD1 debrisoqu 3 

130703 R77776 Hs.18103 ESTs 33 

•130881 AA809875 HsJ25933 ESTs A2 

130954 AB014544 Hs^1572 KIAA0644 gene product 47 

131095 AB99653 H&22917 ESTs 43 

65 131153 H09048 HsJ2380S ESTs 33 

131253 R718Q2 Hs34853 ESTs 33 

131372 AW293399 Hs.144904 rwdear receptor co-repressor1 33 
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131507 AU26268 1&277G9 ESTs, Weakly simfer to MCATJiUMAN MTTOC 3.2 

131587 A1695549 Hs.183868 gtucuronk^se, beta 3.1 

131739 AF017S86 Hs31386 secreted MzzlB(Me)ated protein 2 3.2 

131795 BE501849 Hs32317 h^Hnobi% group 20B 3.2 

5 131970 D86960 Hs3610 HOAA0205 gene product 3.6 

131986 NM002314 Hs36566 UM domain kinase 1 32 

132093 AA400091 Hs39421 ESTs 3.2 

132122 AA426202 Hs.40403 Cb^p30O%iterBC&ig batsacfivator.wft 3 

132159 076435 Hs.41154 Zk fatty mmixrl (attpatoDrcsc^ Z2 

10 132333 AA192669, Hs.45032 ESTs 33 

132406 AL133731 Hs.4774 Homo sapiens mRNA; cDNA DKFZp761 C1712 (f 9.2 

mm AV660345 H&238126 CGM9proteh 82 

132543 BE568452 H&5101 protein regulator of cytokinesis 1 3.4 

132624 AA326108 Hs33829 bHLH protski DEC2 32 

15 132700 AA319233 . Hs3521 ESTs 43 

132725 NKL006276 Hs.184167 spBcing factor, afgtntoeJsertoi*7 3.6 

1327S9 W73311 Hs.169407 SA(^(suppiessc<()racfoinuta6ons2,y 32 

132847 T48195 Hs38189 etftaiyotfctrartiafonh^^ 33 

132857 Y0Q272 Hs.184572 cefldMstoncycte2. G1toSandG2to 4.4 

20 132936 AL120659 Hs.6111 aryUiydroarbon receptor rtucfeerbans! 43 

133130 A1128606 Hs35b7 zhc finger protein 161 33 

133142 AW952412 Hs35874 ESTs, WeaUy similar to A40348 EJav/Sex- 33 

133167 A W1 62840 Hs3641 IdriestotarnBy rrernber5C 43 

133225 AW600291 Hs3823 hypoftefical protoh FU10430 33 

25 133274 AA085191 Hs3949 rrypotheticai protein MGC11275 3 

133275 Z93241 t&239934 CGI-96 protein 43 

133287 AW797437 Hs.69771 B-factor , propetdfo 4.1 

133376 BE618768 Hs.7232 acetyKWTineArarboRyteatiha 5.1 

133462 AW675064 Hs.73875 fanrarytacetoaceiate hydrolase (fumarybc 3 

30 133740 AW162919 Hs.170160 RAB2, mernberRAScfKXjgenetarnQy^e 3.4 

133831 BE274552 Hs.76578 protein tnhlbftorof defeated 6TAT3 33 

133976 AI908165 Hs.169946 GATA-Nrefing proteto 3 (T-cefi receptor 62 

134666 BE391929 Hs3752 trans me rrfrare protein 4 3.1 

134710 AI433797 Hs3889 serine hyftoJcynBttiyfodifeferase 1 (sokib 3 

35 134731 D89377 Hs39404 rnsh 0>osoph8a) torr^ 53 

134776 ^55B2 Hs39603 much 1, tran st nernb rai m 4 

135230 AF0648O4 Hs36757 suppressor of Ty(Sx»evrsiae)3hoinoto 32 

135303 R61253 H&98265 K1AA1877 protein 33 

135400 X78592 Hs39915 androgen receptor (diiydrotestosteroner 4.8 

40 135411 L10333 Hs39947 re&utonl 33 

300089 AH99738 Hs206275 ESTs, WeaWysfrnDartoALUAJ<UMANlin 33 

300233 AW614220 Hs.189402 ESTs A2 

300254 AW183618 Hs35610 sc^caniertomi}y30(zirictramport 93 

300256 AW591433 Hs298241 Transmeritrane protease, serine 3 43 

45 300378 Z45270 Ks£35873 hypofiwficaJpn>tBmFU22672 3.4 ? 

300973 AA572949 H&207566 ESTs 33 

301111 R10799 Hs.191990 ESTs 33 

301341 AA887801 H&203229 6 protelrKoupled receptor 133 

301548 AKB1631 H&203845 tworx)repotasskmicharmelKT33 4.4 

50 301884 AA312082 Hs.105445 GDNFfamHy receptor afchal 5.7 

301936 U79745 Hs.1 14924 solute carrier tarnBy 16 (rrorncarboxyfc 8.6 

301976 T97905 gteye54d0jr1SoeresfetalDtfersptoen 33 

302001 ABQ20711 HsJ278346 K1AA0904 proteto 7.7 

302067 BE542706 Hs 222399 CEGP1 proteto 73 

55 302094 AW749321 Ha.6786 ESTs 33 

302099 A1049670 Hs*137576 rftosomal potato L34 pseudogene 1 42 

302145 MUXB813 Hs.151407 canlagetoteniealBterayerrxoteHnu 73 

302235 AL049987 Hs.166361 Homo sapiens mRNA; cDNA DKFZp554F1 12 (fr 5.6 

302276 AW057736 Hs323910 HER2 receptor tyrostoe kinase (c-erb-o2, 5.4 

60 * 302290 AA179949 Hs.175563 rtorrwsapiererr^cDMDKFZ^^ 34.1 

302372 AL1 17406 Hs^fJ0102 ATP-btodtog cassette transporter MRP8 6.7 

302378 AL109712 H&2S6506 Homo sapiens mR^ftiltef^ insert cON 4 

302384 AI678059 H&202676 syriaptonemalcorrptex proteto 2 43 

302385 AJ224172 K&204096 lpci^B(utBrc«k]Unton^n)eiiM 133 
65 302680 AW192334 Hs38218. ESTs 9.6 

302830 AKB8997 Ks.132921 ESTs 5 

302857 AF282265 Hs.44836 inner cenfromere protein arrfigens (133© 3.4 
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302892 AW176909 Hs.42346 ofcfceurinJ)^ 3.4 

302370 W05608 Hs312679 ESTs, WeaWy simttar to A49019 dyneto he 5.1 

30371 AA652667 Hs36151 HumanDNA sequence torn ctaneRPM 10367 37 

303289 AL121460 H&272673 hypothetical protein FU205D8 4.1 

S 303357 AW006352 Hs.159643 ESTs, Weakly similar to T32554 hypoftefi A2 

303540 AA355607 Hs309490 ESTs, WeaHy similar to putative WHSC1 p 43 

303563 AA367699 Hs.10082 potassium Werrredbte/sr^ 13 

303642 AW299459 gtas50dD8Ji1 NCLCGAPJQdH Homo sajfen 4.2 

303780 AI424014 Hs.18995 WAA1304 parte*) 33 

10 303797 AW629759 gtah70e05.y1 NCLCGAPJ3U1 Homo sapiens 43 

303852 R53434 Hs30207 hypo9ieficalpcDte)nMGC11138 3.7 

304328 AA149951 Hs.62112 rinc fingerprotefo 207 3 

304782 AA582081 gtann32h0ls1 NCLCGAP Gasl Homo sapiens 4.1 

305913 AA876109 gfcnx24h01.s1 NCLCGAPJ5C3 Homo sapiens 3 

1 5 305917 AA876469 gb»e48M4.s1 NCLCGAP _Pr25 Homo sapiens 3.1 

307010 AI140014 gtwja68«»Jt1 SoaresjBtaU>earLNbM19W . 15 

307041 AI144243 gfcqb85b12jc1 Soan3sJstaU)eajtNbW19W 33 

308106 AI476603 gfrtj77e12jr1 SoaresJ^F JBjM_OT_P\J > ^S 43 

308307 A1581398 Hs.172928 cofiagen, type I, alpha 1 4.6 

20 308615 AK000142 Hs.101774 hypotheflcal|m>teinFU23045 4.4 

309177 AB51118 H&326736 Homo sapJensbreast cancer antfcenNY-BR 17.3 

309328 AW024348 Hjl233191 EST, WeaWyslrnQar to A27217 glucose tr 12 

309574 AW16B083 gb:xg59g04jc1 NCLCGAP JJtf Homo sapiens 11 

309583 AW170O35 Hs326736 Homo sapiens breast cancer antigen NY-BR 57.6 

25 310064 A1199712 Hs.146486 ESTs, Weakfy sfrrflar to 191721QA Pro/An} 43 

310098 AI685841 Hs.161354 ESTs 16 

310438 AW022192 HsJ20Q197 ESTs 43 

310683 AI939456 Hs.160670 ESTs 12 

310727 AKD00703 Hs323822 Homo sapiens mRNA for KIAA1551 protein, 33 

30 310781 AB80797 Hs.158992 ESTs 10.2 

310895 A1955121 Hs.165724 f^ace^lactosanune^to^ 3.4 

310955 A1476732 Hs£63912 ESTs * 103 

311117 AJ671439 Hs.196029 Horno sapiens rnRNA for WAA1657 protein, 11 

311166 A18210O5 Hs.1 16599 ESTs 103 

35 311237 AA641098 H&208809 ESTs,MooeratelysimBartoAUJ1JiUM^ 43 

311465 AI758660 Ha206132 ESTs 4.4 

311587 AI828254 H&271019 ESTs, Weakly similar to A47582B-ce8gr 5.1 

311598 AW023595 K&232048 ESTs 53 

311774 AA700870 Hs.14304 ESTs 33 

40 311785 AHJ56769 Ha.133512 ESTs 19 

311872 R12375 Hs.194600 ESTs 33 

311889 AA767342 Hs.122483 ESTs, Weakly simfer to PSF.HUMAN PTBAS 3 

311913 AB58522 H&270188 ESTs 3 

311923 T60843 Hs.189679 ESTs 53 

45 311935 AA216387 gfcnc16b(KLs1 NCLCGAPJ>r1 Homo sapiens 52 

312019 AA373630 Hs.188750 ESTs 3 

312021 AA759263 Hs.14041 ESTs 3.4 

312067 T78968 Hs.14411 ESTs 15 

312090 T80177 Hs.1 18054 sMar to ralrextoubio^ous casein 33 

50 312147 AI633744 Hs.195648 ESTs, Weatysimfer to 138022 hypofliefi 4.4 

312153 BE261944 Hs.118625 howkhasel 5.2 

312168 T92251 Hs.198882 ESTs 33 

312182 T94344 Hs32S263 ESTs 33 

312187 AA700439 Hs.188490 ESTs 14 

55 312199 AW438602 Hs.191179 ESTs 19 

312219 H73505 Hs.1 17874 ESTs 4 

312226 AA315703 Hs.199993 ESTs, Weakly similar to ALUBHUMAN »U 43 

312299 AA972712 H&269737 ESTs 5.7 

312544 AA516420 Hs.183526 ESTs, WeaWyamfer to 138022 hypoftefl 63 

60 312638 AW439195 Hs256880 ESTs, WeaWy shnter to S65657 alpha-1 C- 43 

312826 AW291545 Hs.185018 ESTs 43 

312837 AW292286 H&255058 ESTs 4,4 

312980 AM97043 Hs.115685 ESTs 11 

313070 AM22023 Hs.161338 ESTs 43 

65 313079 K76497 Hs.1787 prieoty*) protein 1 (Pellzaeti&Werzbac 33 

313089 AF026944 H&293797 ESTs 53 

313096 AW073310 Hs.163533 Homo sapiens cDNA FU14142 fis, clone MA 43 
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313126 AA7455Q3 HsJ283313 ESTs 10 

313166 AI801098 Hs.151500 ESTs 3.5 

313197 AW979008 HsJ222487 ESTs 3.3 

313280 AW960454 Hs£22830 ESTs 47 

5 313325 AJ420611 Hs.127832^ ESTs 3.4 

313328 AW449211 Ha.105445 GDNF family receptor afohal 1Z4 

313352 AW150945 Hs.144758 ESTs 4.1 

313385 AKB2087 H&269819 ESTs 3 

313393 AB74685 HsJ200141 ESTs S2 

10 313417 AA741151 . Hs.137323 ESTs 15 

313434 W92O70 gfczh48g05 j1 SoaresJetal>ef^pleefi_ 3.7 

313569 A1273419 Hs.135146 hypothetical protein RJ13984 .3 

313591 AA048309 gb2f12f01.s1 SoarcsJetaLhearLNbHH19W 53 

313615 AI540978 Hs301997 hypothetical protein FU13Q33 3.2 
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314043 AA8Z7082 Hs^91872 ESTs at 

20 314078 AW129357 Hs329700 ESTs 83 

314097 AA648744 HsJ269493 ESTs 6.6 

314121 AI732083 Hs.187619 ESTs 6.2 

314129 AA228366 Hs.115122 ESTs 4 
314138 AA740616 gfeny97f11.s1 NCLCGAPJ3C81 Homo sapiens 53 

25 314236 AA743396 Hs.189023 ESTs 3.1 

314244 AL03645O Hs.103238 ESTs 4 

314305 AI280112 Ks.125232 Homo sapiens cDNA FLJ13266 fis, clone OV 8 

314306 AI697901 Hs.192425 ESTs 3.7 
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30 314394 AW961597 Hs.130816 ESTs, Moderately sfrnfiarto 138022 hypot 42 

314401 AK60412 Hs234557 ESTs 33 

314465 AA602917 Hs.156974 ESTs 4.7 
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314547 AA399272 Hs.144341 ESTs 6.7 
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314627 AA425310 Hs.155766 ESTs, Weafdy slmflarto A47582 B-cel gr 4.4 
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314864 AW9711S8 Hs^94088 ESTs 43 

45 314881 A1095087 Hs.152299 ESTs, Mooerately simBarto S6S>57 afcha 17 

314882 AA828032 Hs.189076 ESTs 11 

314981 AW972359 Hs^93334 ESTs 3 

315006 AJ538613 H&298241 Transmembrane protease, serine 3 105 

315021 AA533447 Hs312S89 ESTs 53 

50 315051 AW292425 Hs.163484 ESTs 123 

315060 AA551104 Hs.189048 ESTs, Moderately similar to ALUCJflJMAN ! 53 

315073 AW452948 Hs257631 ESTs 42 
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315175 AIQ25642 Hs.152530 ESTs 6 
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315196 A067347 HsA4898 HornosapfensdoraTCCCTAO^ 12 
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315240 R38772 Hs.172819 myelh trarealpfcn factor 1-!te 14 
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315368 AB037745 Hs.104696 K1AA1324 proton 4.7 

315397 AA218940 Hs.137516 fidgeWtel 11 

65 315489 AB78817 Hs.191847 ESTs 11 

315498 AA628539 Hs. 116252 ESTs, Moderate* similar to ALU1J1UMAN A 32 

315526 AI193043 Hs.128685 ESTs, WeaWy similar to T17226 hypotheti 4.1 
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CH25LDJ32ttO.GENSCW123 39 

CH22J=GENES343_20 

CKZLQulAC005500.GENSCAN^6 1 

CH2ZJGB1ES3ML2 

CH2?_FGENES.619_4 

CK16Jisgi|5867087 

EST duster (not In UniGene) 

CH2^.BAAC005500.GENSCAN.149 9 

OC2_a««»05500.GENSCAN.421 5 

CH22_F6B*E&134 

CH.07Jtsgt|6004478 

CH22JGENES360J 

CH22J=GBKS3_2 

CHZLC20H1 2.GENSGAN. 1 6 2 

CfrttLCG5E1.GENSGAN.81 

ESTs; WeaHy sim2ar to da Gofcl matrix 

CH22_FGBES307J 

CH2^VtAC005500.GENSCAN^48 14 

CR06J*gi|5902482 

CH2*J=GB*E&669_5 

CH22JU32l10.GENSGAfi19 8 

CH2?JGENES327J6 

CH22J=GENES330_10 

CH22_FGBIES.142 

ESTs 

CH22_FGB<ES^2B7 
CH22JGENES.133 
CH2^B*AC005500.GB4SCANJCB 12 
CH22_FGBE&271_3 



4.1 
4.1 

CoDagen, Type Via. Alpha 1 4.1 

4.1 

4 

3d 
3d 
33 

Guanosto 5* Monophosphate Synthase 

33 

33 

33 

33 

3.7 

Oncogene RetfPtc2, Fusion Activated 

3.7 

3.7 

3.7 

as 

35 
33 
3.4 
3.4 
3.4 
13 
33 
33 

Dna Bhcflng Proten Ap 2, Aft. Soft* 3 

33 

32 

32 

3.1 

3.1 

3.1 

3.1 

3.1 

ai 

3.1 

3.1 1 
3 
3 
3 
3 
3 



33 



3.7 



33 
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TABLE 17A 

Table 17A shows the accession numbers for those pkeys lacking unigenelD's for Table 17. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAsi These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Pkey. Unique Eos probeset Wentffier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 



Pkey CAT number Accession 



116845 393481J AA649530AA659316H64973 
103207 30635.-4 X72790 

126257 182217.1 N99638 AW973750 AA328271 H90994 AA558020 AA234435 N59599 R94815 

102791 371 86 J AF08Q229 AF08O231 AF080230 AF080232 AF08Q233 AF080234 BE550633 A1636743 AW614951 BE467547 AI580833 
AI633818 N29986 U87592 U87533 U87590 U87591 $46404 U87587 AA463992 AW206802 AI97Q376 AJ583718 AI672574 
N25695 AW665466 AIB1832B AA1261 28 AI480345 AW013827 AA248638 At214968 AA204735 AA207155 AA20G262 AA204833 
AW003247 AW496808 A1D80480 AI631703 AI&51023 A1867418 AW818140 AA5025CO AI206199 AI671282 AI352545 BE501030 
AI652535 BE465762 AA20S331 AW451866 AA4710B8 AA206342 AA204834 AA206100 AW021661 AA332922 N66048 
AA703396H32278 AW139734 HS2683 U87589 U87595 H69001 U87534 BE466420 A1624817 BE466611 AI206344AA574397 
AA348354 AM93192 

126872 142696.1 AW450979AA136653AA136656AW4 19381 AA984358 AA492073 BE168945 AA809054 AW238038 BE011212 BHJ11359 
BE011367 BE011368BE01 1362 BE01 1215 BE011365 BBD11363 
1746257J R82040R70934 
176835.1 AA225084AA3Q2713 

324239.1 AB11928AA936Q30T51931 AA609816AA487195AA664207 
genban)LF096G9 F096D9 
oenban)LR08440 R08440 



112631 
120742 
106864 
109700 
111532 
113938 
113947 
124357 
108733 
112303 
322136 

321811 
314648 

322521 
322675 
323332 
316186 
322975 
324261 
X23817 
301976 
324961 
303642 
303797 
319551 
311935 
319834 
319977 
314138 
313591 



genbanOV81598 W81598 
genbankJV84768 W84768 
genbanUC2401 N22401 
504187J AA121Q22AA126422 
genbanXJ«4797 R54797 
46802.1 AF075083H52291H52528 
47334.1 W76326AF086341W7230O 
1527481J D80630D80896D80895 
293660J AW979268 AA878419 AA431342 AA431628 
38916J T55958757205AFW7346 
38917.1 AF147347 T55426 755503 
86787 1 AA017656AA017374AA019761 

179142J AI829520 AI791832 AA228414 AI791823 AA22S211 AA229315 
425440 1 AM33540AA728984AA804981 
1510563J C16391C16413 

273265 1 BG069341AW748403AU)44891AI908240AA393080 

233566 1 AM10943AW948953AA334202AA332882 

128835 J T97905AA101672 

376239.1 AA613792 AW182329 T05304 AW858385 

284260.1 AW299459 AA417112 

386364.1 AW629759 AW749955 AA633408 A1651005 

357371J AA761668 AA573621 R92814 R09670 

174129.1 AA216387T63548AA228676 

112523.1 AA071267 T65940 T64515 AA071334 

345248 1 AA534222 AA632632 T81234 

179960J AA740616 AA654854 AA229923 

103087.1 AA046309AC63500AA046397 
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. 308106 AM76803 
338255 CH2^.6856FG_JJNJeEMAC00 
335809 CH2^_3181FGJ517.6JJNK-EM 
335824 CHZL3197FGJ>19J UlNK^E 
S 307010 A1140014 
307041 AI144243 
305913 AA876109 
305917 AA876469 
309574 AW168083 
10 325372 cISJtt 
325644 c12J»s 

332798 CH2^14FG_6J5JJNKJ*G1.G 
334223 &V2J\5Q7FGJ60JJ1HKJBA 
327075 c21Jts 
15 334447 CH2Z_1746FG_387JJJNK-EM 
304782 AA582081 

313434 441798 J W92070AW019952W92053 



if 
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TABLE 17B 

Table 17B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 17. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed 



10 



15 



20 



25 



Ptey: 

Strand: 
Nlposffiom 



Ptey Ref 



Unkpis number cofffispondircj to an Cos protest 

Sequence source. Tr»7dprHm*>ersfothi3cota "DunhamLetal" refers to ftepubllcafion 

enfiUedThe DMA sequence of human chromosome^ Dunham Ketal. Nature (1999) 402:48*495. 
Indicates DNA strand turn which axons were predicted, 
tntfcates nucleotide positions of predicted exons. 



334447 Dunham, L 
335809 Dunham, L 
335824 Dunham, L 
332798 Dunham, L 
334223 Dunham, L 
338255 Dunham, I. 
325372 5886920 
325544 6682452 
327075 6531985 



Strand 

etaL Ptus 
etaL Ptus 
etaL Ptus 
etai ^nus 
etaL Minus 
etaL Mnus 
Minus 
Ptus 
Plus 



NLposftlon 

14308764-14308824 

26310772- 2631090 9 

2637^6026376942 

232147-231974 

12734365-12734269 

15242294-15242231 

1117081-1117304 

171228-171286 

40413184041431 
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TABLE 18: Table 2 from BRCA 014 P 

Table 18 shows genes with atleast five times the expression in breast tumor tissue than is 
5 expressed in normal body tissues. 



Ptey: Unique Eos process! Identifier number 

ExAccrc Exemplar Accession nurnber.Genbank 

10 UnfoeneD: Unfoene number 

Unpens Ttfle: UnigenegenetaJe 

R1: Ratio of tumor to normal body tissue 



IS Pkey ExAccn UnigenetD UnigeneTffle 



20 



25 



30 



35 



40 



45 



50 



55 



60 



101378 
101530 
101787 
101878 
103010 
104115 
104825 
107105 
108819 
112287 
112561 
112637 
113206 
113970 
114985 
118925 
119905 
121611 
129301 
133976 
134731 
300254 
301884 
3Q20O1 
302067 
302276 
302290 
302372 
302385 
309177 
309583 
310781 
311166 
311935 
312153 
313328 
313915 
314097 
314138 
314506 
314558 
314691 
315006 
315021 
315051 
315060 



BE563085 

M29874 

M81057 

M97815 

X52509 

AF183810 

AA035613 

AW963419 

AA011449 

AB033064 

AI791493 

R82331 

BE262470 

W27249 

AI733881 

N922S3 

AW449064 

M31669 

AF182277 

A1908165 

089377 

AW183618 

AA312082 

AB020711 

BE542706 

AW057736 

AA179949 

AL117406 

AJ224172 

AI951118 

AW170035 

AI380797 

AI821005 

AA216387 

BE261944 

AW449211 

C18863 

AA648744 

AA740616 

AA8336S5 

AI873274 

AW207206 

AI538613 

AA533447 

AW292425 

AA551104 



Hs333 

Hs.1360 

Hs.180884 

Hs.183650 

Hs.161640 

Hs56102 

Hs.141883 

Hs.155223 

HS571627 

Hs334806 

Ks.129873 

Hs.164599 

H&241471 

Hs3109 

Hs.72472 

H&206832 

Hs.119571 

Hs.1735 

H&330780 

Hs.169946 

Hsl89404 

Hs35610 

Hs.105445 

Hs278346 

Hs522399 

H&323910 

Hs.175563 

H&200102 

Hs504096 

Hs326736 

Hs326736 

Hs.158992 

Hs.118599 

Hs.118825 
Hs.105445 
Hs.163443 
Hs569493 

Hs506868 
Hs.190721 
Hs.136319 
Hs.298241 
Hs312989 
Hs.163484 
Hs.189048 



iriterferon-sftnulated protein, 15 kDa 

cytochiome P450, subfamily IIB (phenobar 

carbopcypepfidaseBI (fisaue) 

cellular rBttoofcackJ-bto^ 

tyrosine ajninotrartsferase 

opposite stand to trWxxhlnophalangeal 

ESTs 

stanrtocalrin 2 
ESTs 

K1AA1238 protein 

ESTs.Wea^amBartoA36036cytochrorn 
ESTs 



R1 

53 
9 
12 
63 
12.4 
7.6 
6.9 
53 
6.1 
73 
8.2 
5.4 
6.2 
6.9 
10.1 
197 
8.4 
5.6 
6.2 
6.2 
53 
9.9 
5.7 
7.7 
73 
5.4 



hypothetical protein FU21080 
BMP-RIB 

ESTs. Moderately sfrnOar to ALU8_HUMAN A 
collagen, type 111, alpha 1 {Ehters-Dani 
mn&Jn, beta B (adMn AB beta polypep 
cytochrome P450, subfarcjy IB (phenobar 
GATA-bindlng protein 3 (T-cel receptor 
msh (Drosophila) homeo box hornotog2 
solute carrier ferity 30 (zinc transport 
GDNF fan% receptor alpha 1 
WAAQ904 protein 
CEGPIpnrteh 

HER2 receptor tyrosine kinase (c-enb-^ 
Homo sapiens mRNA; cDNA DKFZp564N0763 (134.1 

ATM)Whg cassette transporter MRP8 6.7 

GpophOin B (uterogbbm famSy member) 133 

Homo sapiens breast cancer arrfioenNY-BR 173 

Homo salens breast cancer antigen NY-BR 57.6 

ESTs 105 

ESTs 10.8 

gjfcnct6b(&s1 NCLCGAPPrl Homo sapiens 5J2 

hexoldnasel 55 

&NF family receptor alpha 1 1Z4 

Homo sapiens cDNA RJ11576 fis, done HE 263 

ESTs 6.6 
9txny97ftU1 NCLCGAP.GCB1 Homo sapiens 5.9 

Homo sapiens CONAFU14056 fis, done HE 83 

ESTs 27.4 

ESTs 20.7 

Transmembrane protease, serine 3 103 

ESTs 53 

ESTs 123 

ESTs, Moderately similar to ALUCjiUMAN I 5.6 
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315186 A1367347 Hs.44898 Homo sapiens done TCCCTA00151 mRNA sequB.2 



315530 AW015415 Hs.127780 ESTs 83 

315634 AA837085 H&22Q585 ESTs 63 

316012 AA764950 Ha.1 19898 ESTs 7 

5 316177 AJ904S82 Hs293102 . ESTs, Moderately sfmfaf to ALULHUMAN A 30.7 

316560 AA938198 Hs.146123 po4y(A) potymerase gamma 9.4 

317803 AW664964 Hs.128899 ESTs 6.1 

317881 AI827248 H&224398 Homo sapiens cDNARJ1 1469 fis, done HE 9.6 

318740 NKL002543HS.77729 odd^lowdens^Spcfvo^Oecfo 73 

10 318744 AI793124 Hs.144479 ESTs 173 

320211 AL039402 Hs.125783 DEME-6 proteto 9.2 

321107 AI732643 Hs.144151 ESTs 123 

321644 AW975944 Hs£37396 ESTs 117 

321978 N77342 H&21851 Homo sapiens cDNA FU12900 fis, done MT 5 

15 322035 AL137517 Hs334473 hypothetical protein DKFZp56401278 19 

322766 AW0688Q5 Hs288467 Hon» sapiens CDNAFU12280 8s, done MA 5.2 

322818 AW043782 H&293616 ESTs 7.6 

322975 C16391 gbeC16391 Clontech human aorta potyA mRN 165 

323262 AL1 33990 Hs.190642 ESTs 105 

20 323332 AI829520 gb»rh9c06Jc1 NCI.CGAP_Ut1 Homo sapiens S2 

323817 AA410943 BMP-R1B 8.4 

324261 BE06S341 gWm8T0381-27010{HJ73<08 BTD381 Homo 49.4 

324432 AA464510 Hs.152812 ESTs 165 

324598 AW972227 Hs.163986 Homo sapiens cDN*FU22765 fis. done K 5 

25 324603 AW933522 HsJ292934 ESTs 10.4 

324907 A075572 Hs.172634 ESTs 184 

325544 Phase2&3Exons 57 

330388 Hs.46 HBB receptor tyrosine HnaseJfrefMfc 6£ 

334223 NfcUJ0508£T>torr» sapiens 262 

30 335809 )^(r1450^Hoinosapiei6krakeivtop^1 10.1 

335824 ENSPO000Q249Q72*:DJ22^13.1 (N-THWINAL 20 

AI052047 ESTs; WeaWy stmBar to CYTOCHROME P450 6.7 

R72427 CH22_BAAC005500.GENa^AN.127 9 55 

CH22_FGENES519_13 92 

35 CH23JH5B€S.617_9 8 

CH2ZJ6B4ES271J7 65 

CH2fc_FGENES.619_7 73 

CH22J=GEhES271_8 123 

CH22J=GENES.619_12 8.4 

40 CH2^3tAC0^5500.GB«SCAN3042 113 ' 

CK07JB 946004473 85 

CH22J=GENE&617_7 7.1 

CH22JFG&E$£T8_$ 12 

CH2^JGENES.678_5 163 / 



■q 
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TABLE 18A 

Table 18A shows the accession numbers for those pkeys lacking unigenelD's for Table 18. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession*' column. 

Ptey. Unique Eos probeset tdentffier number 

CAT number Gene cluster number 
Accession: Genbank accession numbera 

Pkey CAT number Accession 



323332 179142J AJ828520 AI791832 AA228414 AJ791823 AA22921 1 AA22931 5 

20 322975 1510563J C16391C16413 

324261 273265J BB0K341 AW748403AL044891 AI908240AA393080 

323817 233566J AA410943AW948953AA334202AA332882 

311935 174129J AA216387T63548AA228876 

314138 179960.1 AA740616 AA654854 AA229923 

25 335809 CH2?_3l81FG.617JBJiNK3* 

335824 CH2£_3197RL619J1JJNK^ 

325544 t\2Jv> 

334223 CH2^_15O7FG_36O_4JJNK-0yi 
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TABLE 18B 



Table 18B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 18. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed 

Ptoy: Unique number conBspondkiQ to an Eos probeset 

Ref. Sequence source: The 7 o^iuinbafsblhboolujm 

enffladTheDNA sequence of human chromosome 22/ Dunham LetaL, Nature (1999) 402:489-495. 
Strand: Indicates DNA staid mrowhft \ 

Nt_posffioct Indicates nucleotide posfttons of predicted axons. 



Pkey Ref Strand KLposnlon 

335809 Dunham, LetaL Plus 26310772-26310909 

335824 Dunham, LetaL Ptus 26376860-26376942 

334223 Dunham, LetaL Minus 12734365-12734269 

325544 6682452 Pte 171228-171286 
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TABLE 19: 1045 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT TISSUES 

5 Table 19 shows 1045 genes up-regulated in breast cancer compared to normal adult tissues. 
These were selected from 59680 probesets on the Aflymetrix/Eos-Hu03 GeneChip array such 
that the ratio of "average" breast cancer to "average" normal adult tissues was greater than or 
equal to 2.5. The "average" breast cancer level was set to the 90 th percentile value. The 
"average" normal adult tissue level was set to the 90 th percentile value amongst 144 non- 
10 malignant tissues. In order to remove gene-specific background levels of non-specific 

hybridization, the 15* percentile value amongst the 144 non-malignant tissues was subtracted 
from both the numerator and the denominator before the ratio was evaluated. 



Pfeey: 

ExAccn: 

Unfoene©: 

Un&eneTtte: 

R1: 



15 



20 



25 



30 



35 



40 



45 



422506 
424399 
423575 
50 429441 
431474 



55 



Unique Eos probeset ktortfifermmiber 
ExeinpfarAcoesswnrnirnb^ 
Unjgene number 
Un^enegenetate 

Rafiod tumor to normal body fesue 



Pkey ExAccn UnjgenelD UnigeneTitte 



408591 
406964 
400291 
407277 
449746 
426878 
400292 
427585 
408045 
407178 
407377 
450705 
407212 
428848 
404561 
407980 
447350 
450375 
422109 
435496 
453160 
420813 



427217 
.402578 
422805 
424634 
456207 



459587 



AF015224 

M21305 

M401389 

AW170035 

AI668594 

BE069341 

AA250737 

D31152 

AW138959 

AA195651 

C16391 

U90304 

AA412108 

NMJD0Q230 

AA04S309 

AQ75572 

AA009647 

S73265 

AW840171 

AE63307 

X 51501 

ACSB7700 

AL12Q862 

AI805687 

C18863 

AJ224172 

AL133990 

AB014544 

AA399272 

AA436989 

NMJQ3613 

AA193450 

AJ351010 

AA031956 



Hs.46452 mammagbbhl 



R1 

137.6 
713 



Hs.190721 
H&326736 
Hs.175588 

Hs72472 
Hs.179729 
Hs245123 
Hs.104106 

H&25351 

H&269390 

H&194236 



Hs.172634 

H&8850 

Hs.1473 

H&265398 

Hs239884 

Hs39949 

H&317584 

Hs.124165 

H&2533 

Ks.163443 

H&204086 

Hs.190642 

Hs21572 

H&144341 

Hs.121017 
Hs.151407 



gteHuman alpha sateffie and sateffta 3 
ESTs 68.4 
Homo sapiens breast cancer anfigen NY-BR 542 
ESTs»WeaHysintetoCP4YJft]MANCYroC 46.4 
gWmflT0381-27010(MI73-c08 BFT0381 Homo 443 



BMP-RIB 

collagen, type X, alpha 1 (Schrrdd meteph 

ESTs 

ESTs 

gtxC16391 Ctantech human aorta potyA mRN 

Inoquob homeobox proton 5 

ESTs 

lepth (murine obesity hornotog) 
trUnmSnophatangeal syndrome t(TRPS1) 
gbzH2»1.s1 Soares_^J>earUWi19W 
ESTs 

a d&ntegito and metaJtacrotefoase doma 



ESTs, Weatdjy simBarto transformaSon-r 
H2J hfefeme tamfly. member L 
piotacBn-fnduced protein 
ESTs 

prograrnmed eel deafii 9 (PDCD9) 
aldehyde dehydrogenase 9 famiy, member 
Homo sapiens cDNA FU11576 to, done HE 
ppophflln B (uteroglobin lajn^rnanber) 
ESTs 

WAA0644gene product 
ESTs 

Cirj01134rg^2117372tpinJ65981 fatty ac 
H2A hfetone femQy, member A 
carQage htenrediatB layer protein, nu 
gbczr40e07i1 SoaresJJhfMPu_S1 Homo sari 



Hs.102267 rysyl oxidase 



gtrzkl5e04.s1 Soares j>regnanLuterus_NbH 



37.4 
323 
313 
30.4 
27J 
243 
223 
213 
213 
193 
173 
163 
163 
163 
153 
153 
155 
143 
143 
13.7 
133 
133 
133 
123 
123 
122 
123 
113 
113 
115 
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416208 AW291168 Hs.41295 ESTs, Weakly stmflarto MUC2_HUMAN MUCIN 11.5 



429170 NMJ»1394 Ha2359 dual specftty phosphatase 4 113 

407276 AI951113 Hs326736 Homo sapiens breast cancer antigen NY-8R 11.4 

434377 AW137148 Hs.306593 Homo sapiens cONA FU11382 Ms. done HE 113 

5 448390 AL035414 Hs£106& hypothec protein 11.1 

452401 NMJW7115 H&2S352 tunior necrosis factor, aipha^nduced pro 11.0 

421037 AI684808 Hs.197653 prograrr^ceHdeafli9{PDCD9) 103 

452461 N78223 Hs.108106 banscnpSon factor 10.7 

443348 AW87359S Hs.182278 cahnodufa 2 (phosphorytese foase,deft 103 

10. 421155 H87S79 . Hs.102267 lysyl oxidase 103 

402606 NftUB4626:Hofno sapiens hypothetical prot 10.4 

409269 AA576953 Ha22972 hypo«)e&a)pro1ehFU13352 10.4 

447268 AG70413 H&36563 hypoftefcal protein FU2241B 103 

447033 A1357412 Hs.157601 ESTs 102 

15 400295 W72838 H&2533 a)dehyo^dehyo^nase9fan% 10.1 

424905 NMJD02497 Hs.153704 NtMA(r^tnniitosQO^a)-reiatedk 10.1 

432441 AW292425 Hs.163484 ESTs 93 

427365 AJ873274 Hs.190721 ESTs 93 

438950 H23789 Hs.144530 EST 93 

20 422835 BE218705 Hs.121378 rietafiofobrcHn-filte 5, tesfespecffic 9.7 

425692 D90041 Hs. 155956 W^^tyBransfefase 1 (aryiainineN-acety 9.7 

411869 W20QZ7 Ha23439 ESTs 93 

439820 AL360204 H&283853 HonosapleremRNAMlengftbisertcDN 93 

445730 AI624342 Hs.170042 ESTs 93 

25 459583 A1907673 gb:IL-BT152-08Q3994W BT152 Homo sapfcn 93 

452744 AI267652 Hs3Q504 Homo sapiens rrtfW*cDNADKFZp434E()82(!r 9-2 

432596 AJ224741 Hs278461 matr&i3 9.1 

400297 AH27D76 Hs334473 rrypo^eSca)protBhDKFZp56401278 9.1 

449448 D6O730 Hs37471 ESTs 9.1 

30 423945 AA410943 gb2ff2h03/1 Scares ovary tumorNbHOTH 9.1 

406348 Target Exon 93 

424735 U31875 H&272499 shonVchaki alcohol a^hyorogenase tamily 93 

453392 U23752 Hs32964 SRT(sex*!termir*g region Y>box 11 93 

433365 AF026944 Hs^93797 ESTs 83 

35 405654 NA C12001521s#513934lpirp31081 cca3 pr 83 

415601 AA2794S0 Hs36368 caimegin 83 

451110 A1955040 HsJ265398 ESTs, Weakly sinfe to transformaton-r 8.7 

408771 AW732573 Hs.47584 potasston voBagegated channel, delayed 83 

409041 AB033Q25 Hs30Q81 WAA1199 protein . 8.4 

40 442353 BE379594 Hs.49136 ESTs,hAoa^atelysirniartoAUJ7J1UM^ 83 

451561 N52812 Hs.177403 ESTs 82 

424001 W67383 Hs.137476 patemafly expressed 10 82 

429859 NM.007050 H&225952 pioiemtyfD^p tos ph al ase. re 8.1 

423887 ALD8QHJ7 Hs.134585 DKFZP434G232 protBtn M 

45 405095 NA Target Exon &1 

419296 AA236115 Hs.120785 ESTs 83 

447164 AF026941 Hs.17518 Hornosap^do^nriRNA, part^ sequence 83 

413472 BE242870 Hs.75379 sctute carrier famflyl (gBalhferiaffi 83 

416747 AW876523 Hs.15929 hypofoefca! protein RJ12910 83 

50 415385 R17798 Hs.7535 COBW-tf© protafr) 73 

434424 AI811202 Hs325335 Homo sapiens cDNA:aJ23523 fis. done L 73 

420931 AF044197 Hs.100431 smafl tnduribte cyto)^ 7.9 

406687 M31126 Hs272B20 rr^rretaltoproteinase 11 (MMP11;stro 73 

400285 NA Eos Control 7.7 

55 437207 T27503 Hs.15929 hypoftefcal protein RJ12910 73 

427119 AW8805G2 Hs.114574 ESTs 73 

429534 AW976987 Hs.163327 ESTs, Weak* sirrta to 21092GOABceB 73 

433426 H69125 Hs.133525 ESTs 73 

411078 AI222Q20 Hs.182364 CocoaCrlsp 7.4 

60 426214 H59846 Hs.128355 ESTs,Moo^rater/sfrilartoAUJ7J<UMANA 7.4 

447475 AB80797 Hs. 158992 ESTs 73 

415263 AA946033 Hs.130853 ESTs 72 

439569 AW602166 Hs^22399 CEGP1 protein 72 

414142 AW368397 Hs.150042 rtaroosaplerBcDNARJ14438fed(meHE 7.1 

65 426261 AW242243 Hs.168870 peroxisomal tarrtBsyJated proteh 73 

444783 AKD01468 Hs32180 anflSn (DrosophBa Scraps homotogj, act 63 

445885 AI7340Q9 Hs.127699 WAA1603 protein 63 
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444078 
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425236 
421464 
450736 



452838 
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U65011 
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438167 
433330 
449765 
416276 



X525Q9 

L32137 

R28363 

AW207084 

N92293 

U41060 



Hs2Q2676 
Ks.165028 
Hs.144151 
Hs.167771 
Hs.144479 

Hs.131004 
Hs.155223 
H&2248 
Hs.122579 

Hs.161160 
Hs.130239 
Hs.197075 
Hs.620 
Hs39915 
HS283S76 
Hs.1657 
Hs37849 
Hs.159264 
Hs.7678 
Hs.153687 
Hs228320 
Hs334473 
Hs.10887 
Hs.1 12408 
Hs3895 
Hs33169 
Hs.135568 
Hs.106604 
Hs.152812 
Hs37329 
H&237396 
Hs288467 
H&11713 

Hs.156369 
Hs.77367 

Hs.135191 
Hs331Q2 
H&27U9B 
I&241576 
Hs.72472 
Hs.122147 
H&8Z772 
H&22Q933 
Hs.101774 
Hs30895 
HS214410 
Hs.193696 
Hs25144 
H3.102S0 
Hs38471 
Hs26770 
Hs.155223 
Hs.190086 

Hs.12388 
Hs30743 
Hs.12696 
Hs.161640 
Hs.1584 
Hs24286 
Hs.132816 
H&206832 
Hs.79136 



synaptonemal complex protein 2 6.9 

ESTs 6.9 

ESTs 63 

ESTs 63 

ESTs 08 
NM^021058*i^sap^H2Bhistonefami 63 

ESTs, WeakVsfm8af to T1 7227 hypoftefl 67 

stannbcakan2 63 

small bducftde cytokine subfamily B(CX 63 

hypo»egra>protehaj10461 63 
gbcCM1-ST0277-061299<J594)07 STB277 Homo 63 

ESTs 63 

ESTs 63 

ESTs 63 

buDous pernprrigold antigen 1 (23Q/240kD) 63 

androgen ittfepktf (dihydvolestosterone r 63 

ESTs, WeaWy sknfarfo unknown protein 63 

estrogen receptor 1 63 

ESTs 63 

Human done 23948 mRNA sequence 6.4 

oefiularrefinotoacid-birKfing protein 1 6.4 

tnosrtoi polyphosphates-phosphatase, ty 6.4 

hypcflie6caJr^rjfefoFU23537 6.4 

hypcffiefcaJ protein DKFZp56401Z78 62 

sirnSarto lysosofne-assodated membrane 6.1 

S100catBUn^«^protehA7{pso^ 6.1 

ESTs 6.1 

matrix ntetaflopmfehase 1 (MMP1; inters 63 

ESTs 63 

ESTs 6.0 

ESTs 53 

HSPC072proteh 53 

ESTs 53 

Homo sapiens cDNARJ12280fis, done MA 53 

E74-fite (actor 5 (eis domafr) transcript 53 
gb^4M)073O9030n-05OO03 UM0079Homo53 

hypoffteGcaJ protein slnttbr to tenasdn 53 

monokine induced by 9anvna lutralrtui 53 

gbcyg06h01j1 Scares Mailt brain 1MB H 53 

ESTs, Weakly siroBar Id imnamed protein 53 

transcription factor AP-2 beta (acSvafi 53 

hypo&fefca) protein FU23293 siniBar to 5.7 

hypriheftk^ptDtefrPR02577 5.7 

BMP-RIB 53 

ESTs 53 

cofegen, type XI, alpha 1 53 

ESTs 53 

hypc^e&al protein aJ23045 53 

horneoboxC4 5.4 
ESTs, Weakly similar to MUC2J1UMAN MUCW 5.4 

ESTs 5.4 

ESTs 5.4 

(^srtRKP-«pec&4OkDapr0teh(RRp8- 5.4 

ESTs, Weakly stmflar to T18712 hypotefi 5.4 

fatty add binding protein 7, txah 54 

stanntocakSn 2 53 

ESTs 53 
gb£ST382140 MAGE resequences. MAGK Homo53 



ESTs 53 

prufwwrfldfly expressed antigen h meta 53 

con^dmSH3ctamairvto^ 53 

tyrosine amtartransferase 53 

caifflageolfc^rnericmat^ 5J2 

ESTs , 5.2 

hypoft^caJ protein MGC14801 52 

ESTs.MooWelysinT8artoAU«_HUMANA 5.2 

UV-1 protein, esbogen regulated 52 
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400300 X03363 HER2 receptor tyrosine Hrase(c-erW)2, 5.2 

418004 U37519 H&87539 alofehyte ctehydioc^^ 5.2 

439840 AW449211 Hs.105445 GDNF fam9y receptor alpha 1 52 

428771 AB028992 Hs.193143 WAA1069 protein 5.2 
5 455047 AW852530 gb-PM1-CT0243^J7109W)01^08 CT0243 Homo5^ 

419169 AW851980 Hs262346 ESTs, WeaWy sintotoS724^hypoftefi 52 

453197 AI916269 Hs.109057 ESTs, Weakly sWIar to ALU5_HUMAN ALU S 5.1 

40088 AA032279 Hs51635 ^transmembfaroep&^anfigeno^ 5.1 

431023 AEB3133 Hs^97420 ESTs 5.1 

10 427668 AI791495 . Hs.180142 calmoduDrvte stin proteh 5.1 

427718 AI798680 K&25933 ESTs 5.1 

434531 AA542007 Hs.1 16369 ESTs 5.1 

429220 AW207206 Hs.136319 ESTs 5.1 

405494 NA C^837^gJl12697903|a^|BAB21770.1|(A 5.1 

15 452930 AW1 95285 Ks.194097 ESTs v Weakfys^tol38022rqrpotfie8 5.1 

444910 AI2D1849 gfcqs76g04.x1 NCLCGAPJ>r28 Homo sapiens 5.1 

453310 X70697 Hs553 solute canter famBy 6 (reuolr ansrnR te 5.0 

444381 BE387335 Hs383713 ESTs, WeaWysmfiartoS64054hypoffietj 5.0 

450603 R43646 Hs.12422 ESTs 5.0 

20 416575 VWE414 Hs38383 ESTs 5.0 

438504 AW665281 Hs*224625 ESTs 5.0 

416209 AA236776 Hs.79078 MAD2(m^airest ardent yeast, h 5.0 

428804 AKD00713 Hs.193736 hypc«ie1icalpn>tehFU20706 5D 

420077 AW512260 Hs57767 ESTs 4.9 

25 450480 XB2125 K&25040 ztnc finger prototo 239 4.9 

437637 AJ003029 Hs.65792 syntrophi%gamma2 4.9 

431608 M30703 H&270833 amphheguBn (schwaniwrn^derfved growBi 4.9 

418836 AI655499 Hs.161712 ESTs 4.8 

442441 A1820662 Hs.129598 ESTs 45 

30 435635 AF220050 Hs.181385 urniiaiadBn^hematopcleficstenV^e 4.8 

400286 NA C160009223lj74991(Blpfr{p?09Q3 bypothe 4.8 

407506 U71600 gb^umanzirrcfcgerpratozfpSI (rf31 4.8 

420026 A1831190 Hs.166676 ESTs 4.8 

441377 BE218239 Hs3Q265B ESTs 4.8 

35 457726 A1217477 Hs.194591 ESTs 43 

412785 AW997556 Ks.78521 WAA1717 protein 45 

428366 BE440042 Hs.83326 n^metaltopi0tetoe3(stnmw^ 4.7 

436026 AJ349754 K&217081 ESTs 4.7 

409110 AA191493 Hs.48778 ntban protein 4.7 

40 400284 NA estrogen receptor 1 4.7 

410102 AW248508 Hs£79727 Hon» sapiens cDNARJ14035fis, clone HE 4.7 

407819 R42185 Hs7748Q3 ESTs 4.7 

430486 BBQ62109 Hs>241551 chloride channel calcium ac&vated. fem 4J 

422896 AW961489 Hs.154116 ESTs 4.7 

45 453616 NMJB3462 Hs33846 c^iefe, atoned |grd intend 4.7 

427427 AF077345 Hs.177936 ESTs 4.6 

421751 AW813731 Hs.159153 ESTs»ModerateJysimiark)S65657 alpha 4.6 

454074 R63503 Hs^8419 ESTs 4.6 

405718 C4000799'Si|6330365|eTpAA 4.6 

50 444649 AW207523 Hs.197628 ESTs 4.6 

429431 Z40313 Hs.106330 Homo sapiens done IMAGE23371. mRNA seq 4.6 

427811 M81057 Hs.180884 cartoiypepfidaseBI (fcsue) 45 

447342 AI199E268 Hs.19322 Homo sapiens, Slnta to RIKEN cDNA 2010 4.6 

430345 AK0OQ282 Hs^39681 bypotr^protehRJ2Q275 4.6 

55 454307 AW855717 g>RC1-CTQ27946129W)1^b01 CT0279 Homo4.6 

400303 AA24Z758 Hs.79136 UV-1 protein, estrogen regutated 4.6 

438180 AA808189 H&272151 ESTs 4.6 
451340 AW936273 QbtMUT1XmW02QM07-qP7 DT0Q20 Hon»4.6 

458711 ALD36877 H&282678 ESTs 45 

60 457430 AA514660 Hs.128443 ESTs 4.6 

416030 H15261 Hs£1948 ESTs 43 

447233 AW246333 Hs.17901 Homo sapfens, done IMAGE3937015, mRNA, 43 

445537 AJ245671 Hs.12844 EGRfcentomaln. mufiipte 6 (E6R6) 45 

424590 AW966399 Hs.46821 hypothetical prate* FU20086 45 

65 432374 W68815 Hs301885 Horo sapiens cDNAFU11346Bs, done PL 45 

423833 AW503329 gbdJU*F*N0-ata-^-(WLr1 NIH_MGC_50 45 

406747 AI925153 Hs217493 aimexfnA2 45 
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431716 
411050 
401418 
5 436194 
436211 
414080 
424115 
415786 

10 442117 
45Z784 
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410534 
405196 

15 430217 
401793 
415747 
423679 
400238 

20 425627 
400608 
458634 
407771 
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445263 
407162 
419536 
30 454359 
411556 
450715 
421451 



35 409757 
413043 
413499 
444819 
408380 

40 406992 
404285 
425247 
428046 
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45 421147 
426451 
415227 
452176 
452862 

50 443646 
425523 
424687 
430009 



H56435 
D89053 
AW8149Q2 
NA 

AKD01074 

AKD01581 

AA135257 

AA335497 

AW419196 

AW664964. 

BE463857 

R31178 

AW905138 

NA 

N47863 
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AB007975 
NA 
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NA 
NA 

BE247684 
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N71277 
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AI266484 

AA291377 

AA033714 

NML001898 

BE158766 

BE144884 



gfcyo£8e09Jl Soaresfetal fiver spleen 4.5 

H&268012 fat^dd^oen^AQgase^tof^^ 4.5 
gb*flR1-ST0206-120400^22-TO8 ST0206 Homo 4.5 

C14000338*^745^02fc*BS74665 outer 43 

Homo sapiens cDNAFU10212fi3» done HE 4.5 

hypoftefcal protern FU10719; WAA1794 4.4 

Baggressh^tymphoiTBgene 4.4 

ESTs. Weak*/ sMtarto 138022 rrypothefi 4.4 

hypcfrefcal protein FU13782 4.4 

ESTs 4.4 

typotietka) protein FU21062 4.4 

fibronecfinl 4.4 
gb^MMW10714»0400-207-g07 NN1071 Homo4.4 



Hs333435 

H&334828 

Hs.47783 

H&2939G5 

H&257924 

Hs.128899 

Hs.151258 

H$^87820 



55 451381 
450229 
455700 
431924 
438885 
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431676 
409092 
429270 
443903 
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432912 



AF123050 

S82472 

NA 

ML0O594O 

AW812795 

AA026880 

AW592167 
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AW821113 

AA024538 
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AW85198 

AB007948 

JQ5070 
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AA634806 
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R1B717 

BB068115 

AKD00850 



Hs336901 

Hs.131454 

Hs^97M7 

H&282898 
Hs.62713 



Hs.103070 

Hs.42586 

Hs.142634 



Hs.70725 

K&31570 

H&50831 

Hs^87629 

Hs.123114 



H&8172 
Hs.44532 



A1685464 

AI735283 

WG0379 

A1220547 

AW057736 

U85658 

BB007371 



(2000662^512792tpirpi2482 hypoihe 4.4 

rbosornal protein S24 4.4 

C17001545^5360127|gb|AAD42882.1|AF155 4.4 

gb£ST94257 AcBvated T-ceBs I Homo sap 4.4 

KIAAD50S protein 4.4 

C19000274*^)12741327J^XP_0088312J 4.4 

rrembrane-bound to a&cn Vl lon factor prot 4.4 

C10001899^508633$r|IT25392 hypothe 4.4 

ESTs 43 

ESTs 4.3 

TaigetExon 43 

Target Exon 43 

ESTs 43 

WAA1560 protein 43 

zinc linger protein 43 

gfcnp12d11.$1 NC1^CGAP_Pr3 Homo sapiens 43 

gteza36e03*1 Soares fetal fiver spleen 43 

garnma^mtotertyte 42 

ESTs, WeaHy sfrnSar to KIAA1324 protein 42 

ESTs 42 

hypoftefical proton FLM4260 42 

cystafinSN 42 

gb:lL2-HTO397-O7129WC44 : 02 HTD397 Homo 4.2 
gbcCMfrHT01B2<>4109W)6&e11 HT0182 Homo42 

ESTs, Mcx)eiatelysirn8artoA46010X«i 42 

dtubtqinon 42 

gb±etei»H)NA polymerase befe{exon a 42 

C6001909gp>4441|Aptt189Q9.1I (D298 42 

Hs.155324 ma^meteSoprotehasell (MMP11;sfro 42 

Hs.155381 ESTs,Modefl^smnIarto)38Q22hypot 42 

H&25252 prolactin receptor 42 

H&293299 ESTs 42 

Hs.169946 GATA4AKfing protein 3 (T-cel receptor 42 

HsJ2402 ESTs 42 

H&282990 Human DMA sequence lh)m done RP1-28H» 42 

Hs3687 ESTs 42 

Hs.164226 ESTs 42 

Hs.158244 K1AA0479 protein 4.1 

Hs.151738 maM(roelalk)pretBhase9(p^^ 4.1 

1&22242 ESTs 4.1 

gfcab28cQ2jr1 Stefegene bmg (937210) H 4.1 

H&172330 hypoOteo^ proton MGC2705 4.1 

H&8329 hypoO)6Q^ protein FU11362 4.1 
gb£W-BTQ36W)6129W)60g07 0TD368 Homo4.1 

H&Z72203 HonosaplenscDNAaJ20343^cloneAD 4.1 

Hs.184987 ESTs 4.1 

Nhl_004496^Hort»sapiera 4.1 

gfctt88»4 XI NCLCGAP_Pr28 Homo sapiens 4.1 

Hs.172608 ESTs 4.1 

H&57773 ESTs 4.1 

Hs.135223 ESTs - 4.1 

HS323910 HER2ieceptortyrostrml(toase(o«i>^ 4.1 

Hstfl796 tjansatptotactor/^-2ganmra(ac^ 4.1 

H&20O313 ESTs 4.1 
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403585 TaigetExon 4.1 

438295 AJ394151 Ha37932 ESTs 4.1 

420380 AA64Q891 Hs.102406 ESTs 4.1 

431118 BE264901 HsJ2505Q2 cartx^fcanhydiaseVllI 4.1 

5 416162 NML004354 Hs.79069 cycDnG2 4.1 

418994 AA296520 Hs39546 sdec8nE(«Kk)8iefeJadhestonnwtecu^ 4.1 

400555 Target Exon 4.1 

410079 U94362 Hs58589 pAcogenfo2 43 

427674 NML003528 H&2178 H2Bhtstonef3m9/ ) member Q 43 

10 427131 AM48460. Hs.112017 GE36gene 4.0 

439759 AL359055 Hsj67703 Homo sapiens mRNAtiil length tnsertcDN 4.0 

429353 AL1174C8 H&20OW2 A7P4)iraftTgcasseaetranspOftef MRP8 43 

421296 NMJQ2666 Hs.103253 perffiptn 43 

418819 AA228776 Hs.191721 ESTs 43 

15 424188 AW954552 Hs.142634 zhc finger protein 43 
455431 AW938484 gb^MMyT00S7-59020O-25M06 DT0O57 Homo43 

404142 NA Target Exon 43 

441143 A1Q27604 Hs.159650 ESTs 43 

444540 AIG93927 H&2651G5 ESTs 4i) 

20 415579 AA165232 Hsl222Q69 ESTs 4.0 

452891 N75582 H&212875 ESTs, Weak)/ similar to DYH9JUJMAN CUA 43 
414606 BE390440 gfc601283601F1 N&LMGCL44 Hot© sapiens c 4.0 

452281 T93500 H&28792 Homo sapiens cDNA RJ11041 fc, done PL 4.0 

417801 AM17383 Hsl82582 tntegrtn, bete-fike 1 (with EGF-Bce rep 43 

25 446232 A1281848 Hs.194691 refinoicackt Induced 3 4.0 

447377 X77343 H&334334 fcanscrfp&n factor AP-2 alpha (acfivat 4j0 

437854 AL119723 gbDWFZp761A2124_r1 761 (synonym: hamy2) 43 

446140 AA356170 H&26750 hypoftefcal protefc FU21908 4.0 

452240 A1591147 Hsj61232 ESTs 4.0 

30 459574 AI741122 Hs.101810 Homo sapiens cDNAFU14232 ketone NT 4.0 

458673 N99626 gbaa39oi 1x1 Scares Hal Bwr spteen 4.0 

444858 A1199738 H&208275 ESTs, WeaWy simiar to ALUAjflJMAN Illl 4.0 

452166 A1948607 H&264680 ESTs 4.0 

452681 AF153330 H&3Q246 solute carrier tamfly 19 (thiamfoe Irans 33 

35 450192 AA263143 H&24596 RAD51-feitBiacBng protein 33 

406554 NA TargetBton 33 

416259 AA573006 Hs.19173 ESTs 33 

445813 Z42023 Hs.106576 alatfne-glycixytete arnmotiansferase2.fi 3d 

451024 AA442176 gbcav63b08 x\ SoaiesJotaUetus JflJ2HF8_ 33 

40 413930 M86153 Hs.75618 RAB11A. member RAS oncogene famSy 33 

401781 TaraetExon 33 

415296 FO5086 H&328142 ESTs 33 

452564 AA026777 gb2E93c11 jl Soares_{efaUieait.NbHH19W 33 

442500 AB19068 H&209122 ESTs 33 

45 419759 221336 Hs.135411 acfin related proteh 33 

424638 A1472106 Hs.49303 Homo sapiens cDNARJ11663 8* clone HE 33 

439689 AF086534 Hs.187561 ESTs. Moderated sMto to ALUIJttJMAN A 33 

428042 AA419529 K&76391 inyxovlrus fmfluenza) resistance 1, homo 19 

452501 AB037791 rfe29716 hypoftefical proteh RJ10980 33 

50 453049 BE537217 Hjl3Q343 ESTs 3.9 

443213 BB68414 Hs.145497 Homo sapiens eONA:RJB2D97§s, clone H 33 

443489 AI073512 Hs.133916 ESTs 33 
455092 BE152428 gtKCMWfTD323-151299-126-b04 HT0323 Homo33 

401785 NKL002275*itonx> sapiens keratin 15 (KRT1 33 

55 426427 M86699 Hs.169840 TTK protein Wnase 33 

446009 AJ989885 K&231926 ESTs 33 

436033 H75391 Ha255748 ESTs 33 
451067 BE172186 gbMWm)55^11030(MI05^11 KTQ559Homo33 

419348 AA236645 Hs38274 ESTs 33 

60 444635 AM8426B H&3396G5 ESTs 33 

412140 AA219691 Hs.73625 RAB8 interacting, WnesMke (rabkines 33 

403593 NA TaigetExon 33 

442323 AW016669 H&29190 ESTs 3.8 

419854 AW664873 Hs37836 Homo sapiens PACctoneRP5-1087M19 from 33 

65 433871 W02410 H&205555 ESTs 3.8 

445253 A1217928 H&144762 ESTs 3.8 

409542 AA503020 H&36563 hypofretfcal protein FU22418 3.8 
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443162 T49951 Hs.9029 DKFZP434GQ32 protein 33 

458134 AW383618 Hs_S5459 ESTs, Modefa^sirnttartoAUi?JiUMAN A 33 

422475 AL359938 Hs.1 17313 Mels (mouse) homolog 3 33 

440705 AA904244 Hs.153205 ESTs 33 

5 447290 AJ476732 H&263912 ESTs 33 

403426 Target Exon 33 

427821 AM70158 Hx98202 ESTs 33 

454288 BE222648 H&279458 ESTs. Highly similar to c380M.1b [H^ap 33 

443801 AW206942 HsJ253584 ESTs 33 

10. 410858 AW105231 Hs.192035 ESTs 33 

410672 AW794600 gb«C&UM0014-17030OO22-C05 UM0014 Homo33 

428579 NM.005756 Hs.184942 6 prot^KOUpled receptor 64 33 

445495 BB22641 Hs38489 ESTs, WeaMy sfmterto (38022 hypothett 33 

447995 AI742618 Hs.181733 ESTs, WeaMy sbnfer to nflrOase hornoio 3.7 

15 401747 Homo sapiens terath 17 (KRT17) 3.7 

420633 NMJM4581 Ha_74480 (xkwartttnding protein 2A 3.7 

423545 AP000692 Hs.129781 chronn6one21op8niea(^frame5 3.7 

' 433138 ABQ29496 Hs39729 semaphormsentf 3.7 

434715 BE005346 Hs.116410 ESTs 3.7 

20 428664 AKD01666 Hs.189095 aitotoSALLI (sal(Drosoph3aHik8 3.7 

450951 AA018534 Hs.103334 ESTs 3.7 

402696 NA C30Q2523^t[668&211tsp|QZ7533(rH2M-CAEa3.7 

446868 AV660737 Ks.135100 ESTs 3.7 

458154 AW816379 H&335018 ESTs 3.7 

25 422026 U80736 Hs.110826 frinuJet^ repeal coniaWng 9 3.7 

419440 ABQ20689 Hs30419 WAA0882 protein 3.7 

421524 AA312082 Hs.105445 GDNF family receptor alpha 1 3.7 

417283 N62840 Hs.48648 ESTs 3.7 

401508 HA NKL024817*tano sapiens hypofrstfcal prot 3.7 

30 410303 AA324597 H&21851 Homo sapiens cDNA FU12900fis, clone NT 3.7 

420362 U79734 Hs37206 hunfin#i trferacfing protein 1 3.7 

433384 AIQ21992 Hs.124244 ESTs 3.7 

434302 AA629065 Hs.116301 ESTs 3.7 

443938 R55373 H&20864 ESTs 3.7 

35 448420 BE623004 gfc601441282F1 WH_MGC_72 Homo saptens c 3.7 

458712 AI3475Q2 Hs.107872 hypoftetoi protein RJ20761 3.7 

433404 T32982 Hs.102720 ESTs 3.7 

405232 NMJ>15B32flara> sapiens iriefliyl^ biixfin 3.7 

430491 AL109791 Hs_M1559 HanosapierBmRNA^tengft Insert cDN 3.7 

40 455609 BE011668 gMmBN0223-1C050O-177-aO4 BN0223 Homo37 

450164 A1239923 Hs30098 ESTs 3.7 

453948 A1970797 Hs34859 ESTs 3.7 

436061 AI248584 Hs.190745 Homo sapiens cDNA:FU21326 fis. clone C 37 

401049 NA TaigetExon 33 

45 418867 031771 Hs39404 rnsh(Drosoph^rwr^ 33 

420179 N74530 Hs21168 ESTs 33 

456663 AV658444 H&280776 tanfeyrase. W1-Meracfrig ankyrtrweb 33 

437259 AI377755 Hs.120695 ESTs 33 

428309 M97815 Hs.183650 (xton^cado^binaHig proto!n2 33 

50 450522 AI698839 grxwcB1fQ2jrt SoaiesJ4FL.T_6BC.S1 Homo $33 

451952 AL120173 H&301663 ESTs 33 

412209 AW901456 gJbJ^Cf>W<1012-27rj300-031-c07 NN1012 Homo3.6 

425201 AA352111 gh£ST60061 AcfivatedT-ceflsXXHomosa 33 

443830 AI142095 Hs.143273 ESTs 33 

55 439255 BE164500 g>ilC4-HT0469^30(W)14^10 HT0469 Homo 3.6 

414869 AA157291 Hs21479 ubhuctetol 33 

409064 AA062954 Hs.141683 ESTs 33 

407721 Y12735 Hs.38018 ctek*ed&tyfrrtt*^^ 33 

445135 AK000054 Hs.12347 rrypttfieflcal protein FU20047 3.6 

60 404091 NA TaigetExon 33 

409731 AA125985 H&56145 fliymosin, bete. Hentffied In neuroblast 3.6 

405153 Target Exon 33 

423248 AA380177 Hs.125845 rfcuk»e^)hosphat0~^^ 33 

403639 NA ENSP0C000233023*tCDNA FU12662 fis, don 3.6 

65 404360 C70»385^«120a2809|gb|AAG486iai|AF315 33 

422352 AA766296 Hs.99200 ESTs 3.6 

423338 ABO07961 Hs.127338 WAA0492 protein 33 
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424202 


BE350295 


Hs.15032 


RAN btndbg proteh 17 


3.6 


431750 


AA514986 


Hs.283705 


ESTs 


3.6 


439907 


AA853978 


Hs.124577 


ESTs 


3.6 


453596 


AA441838 


Hs.62905 


hypdheM proteh FU14834 


33 


406446 


NA 




Target Exon 


16 


418454 


AA315308 


Hs.195870 


hypomefical proteh FU14991 


16 


434360 


AW015415 


Hs.127780 


ESTs 


33 


409079 


W87707 


Hs.82065 


hferteukto 6 signal transducer <gp130, 


3.6 


440132 


A1697121 


H$^02466 


ESTs, Weakly sirnferto S65824 reverse t 


3.6 


448706 


AW291095 


Hs21814 


htarletiltin 20 receptor, alpha 


3.6 


440671 


AW297920 


Ns.130054 


ESTs 


33 


407647 


AW860158 




gbftCCUXQ379£9010(HK£b04 CTO379 Homo33 


458023 


AW968226 


Hs30798 


ESTs 


15 


4Q2B20 






NMJH7646^Homo sapiens fRNA isopentenyl 


33 


417009 


AA191719 


Hs314714 


ESTs 


33 


447048 


AW3930B0 


H&228320 


hypofcetxal proteh FLE3537 


33 


449978 


A1806335 


H&200829 


ESTs, Weakly strrto to T30171 nheh - 


33 


428062 


AA420683 


Hs£8321 


hypothetical protein FU14103 


33 


452909 


KM_015368 


Hs30985 


pannexlnl 


33 


400610 


NA 




Target Exon 


33 


417843 


W07361 




Homo sapiens cDNA HJ12935 fis, done NT 


33 


419335 


AW960146 


Hs^84137 


hypothetical proteh RJ12B88 


33 


451592 


AI805416 


Hs£13897 


ESTs 


15 


443270 


NM-004272 


HSu337737 


Homer, neuronal immediate early gene, IB 


33 


423948 


AW392342 


Hs^83077 


centrosoma) P4.1^associated raoteh; unc 


15 


449424 


AW44B937 

flf 1 1 fWVl 


Hs.197030 


ESTs 


33 


423841 


AW753967 




gtKRC2^T0304^010OO11-h12 CT0304 Homo 33 


416806 


NM 000288 


Hs.79993 


peroxisomal bfogenesfe factor 7 


33 


422060 


R20893 


Hs.325623 


ESTs, Moo^rateVs^toALUSJftJMANA 


33 


433104 


AL043002 


Hs.128246 


ESTs, Moderately similar to unnamed prot 


33 


415778 


H84847 


Hs.49391 


riypofoefcal protein LOC54149 


33 


413054 


AW316843 


Hs.66309 


hypofteflcal protein MGC11061 ^ 


33 


416638 


N32>36 


Hs.42645 


sohilB carrier famfly 16 (rnonocartxayfc 


33 


424639 


AB17494 


H&9812 


Homo sapiens cONA HJ143B8 fis, clone HE 


33 


424827 


AI057D94 


Hs.96867 


Homo sapiens cDNA: FU23155 fis, done L 


33 


437782 


A1370876 


H&79090 


exporbn 1 (CRM1,yeast homotog) 


33 


411514 


AW850178 




g4rfL3^T0219^710^Q22-Hl2 CTO219 Homo 


33 


413783 


AA314337 


Hs301547 


rfbosoma) proteh S7 


15 


421106 


AA877124 


Hs.172844 


ESTs 


33 


431291 


N25521 


HsJ*5275 


KnjppeM)^ zinc fino^ proteh 


33 


440623 


AB35016 


Hs316639 


ESTs 


33 


455838 


BE145808 




gbJimHT0208-101299-103^11 HTO208 Homo 33 


458771 


AW295151 


Hs.163612 


ESTs 


15 


442942 


AW167087 


Hs.131562 


ESTs 


15 


436550 


Z50158 


HS27QZ35 


ESTs, WeaMystrnSarto MMHUB1 tamhh b 


15 


418849 


AW474547 


Hs33565 


Hon)OsaptorKPK^mfWformanrosy&^ 


15 


424430 


BB14743 


Hs.146688 


prostagtandh E synthase 


33 


430916 


AW505G21 


H&68414 


BTB and CNC homology 1, bash leucine zl 


15 


432030 


AB08400 


Hs.143789 


ESTs 


15 


439405 


AF086224 


Hs35238 


ESTs 


33 


405917 


NA 




C17000675.-gJJ7M0703lBblAAF46150.1l (AE0 


33 


452727 


AW993582 


Hs.176220 


ESTs 


33 


426320 


W47595 


Hs.168300 


bansfbiiuiiiy growth factor, beta 2 


14 


421070 


AA283185 


Hs.19327 


ESTs 


14 


424625 


AW904466 


HS321197 


PDZ (tomato proteh (Otosoptfa haWte 


14 


428508 


BE252383 


Hs.184668 


S8B131 proteh 


14 


455651 


BE0649B2 




gW*C1-BT0313-13040OO1&<fl2 BT0313 Homo 3.4 


410555 


U92649 


Hs.64311 


a dkhtegrte and rnetaloproteinase doma 


14 


447754 


AW073310 


Hs.163533 


Homo sapiens cDNA RJ14142 fis, clone MA 


14 


418636 


AW749855 




gb^4-BT0534-28l29&053*05 BT0534 Homo 14 


404097 


NA 




C5000242^II9369379|^HAAF87128.1}AC008 


14 


434205 


AF119861 


H&283032 


hypofteBcal proteh PRO2015 


14 


421072 


AT215069 


H*39113 


ESTs 


14 


402421 


NA 




C10O1578^6759^{gb|AAF2B099.1| (AF1 


14 


405248 


NA 




Target Exon 


3.4 


407638 


AJ404672 


Hs334483 


hypoftefcal proteh FU23571 


It 


403000 


BE247275 


Hs.151787 


U5 snRNP-spedfic proteh, 116 kD 


3.4 
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433393 AF038564 Hs.98074 ftchy (mouse homotog) E3 ublqurtin prote 3.4 

432239 X81334 HsJZ936 matrix fTietadoprotefriase 13 (ajflagenase 3.4 

458747 BE618395 Hs^7391 hypothetical pmteh DKFZp761J1523 3.4 

442082 R41823 Hs.7413 ESTs; catsyntenM 3.4 
5 417S74 AA210765 gbczr90c06j1 NCLCGAPJ3CB1 Homo sapiens 3.4 

446002 AB46468 Hs.145789 ESTs 3.4 

4489% M613276 H&5662 guaiitenucfeo8<tebfa^piDtefo{Gpf 3.4 

436007 AB47716 Hs332168 ESTs 3.4 

424698 AA164366 Hs.151973 hypotttefica! protein RJ23511 3.4 

10 435202 A1971313 . Hs.170204 KIAA0551 protein 3.4 

410467 AF1 02546 Hs.63931 dachshund (ftosophBa) homotog 33 

405460 NA TaigetExon 33 

441826 AW5Q36Q3 Hs.129915 phosphobfesteiase related 3.3 

453472 AL037925 o££KFZp564M037_r1 564 (synonym: hft*2) 3.3 

15 447078 AW885727 Hs301570 ESTs 33 

441690 R81733 Hs33106 ESTs 33 

420092 AA814043 Hs.88045 ESTs 3.3 

418478 U38945 Hs.1174 cycHrvdependent kinase fohWtor 2A(me 33 

408908 BE296227 Hs^50822 sefro/Bireorine kinase 15 33 

20 414737 AI160386 Hs.125087 ESTs 33 

449650 AF055575 H&23838 cafctom channel, voltage-dependent, L ty . 33 

418912 ML000685 Hs*9472 angiotensin receptor 1 33 

4&405 AA1 60079 Hs.172932 Homo sapiens mRNA for partial 3TTTR, seq 33 

453911 AW503857 Hs.4007 Saicotefnma^ssodated protein 33 

25 409361 NM.005B82 Hs3441B sine ocufis homeobox (DrosophBa) hamoto 33 

429548 AW138872 Hs.135288 ESTs 33 

420807 AA280627 Hs57848 ESTs 33 
4098% AA296981 gb£ST11K14 Adrenal gland tumor Homo sa 33 

445189 AI336450 Hs.147482 ESTs 33 

30 402892 NA TaigetExon 33 

426681 AA994896 H&22514 ESTs 33 

458722 AA741545 H&282832 ESTs, WeaWy simBar to T24961 hypolheti 33 

409430 R21945 Hs.166975 spiking factor, araWne/sertne-rich 5 3.3 

443194 A1954968 rte579009 matrix Gfai protem 33 

35 445432 AV653771 gfcAV653771 GLC Homo sapiens cDNA done 33 

410908 AA121686 Ks.10592 ESTs 33 

406151 NA TaigetExon 33 

436461 AW511956 Hs293261 ESTs 33 
411171 AW82Q260 $KQV2-ST0296-15G20(H)4f>c10 ST0^6 Homo 33 

40 432415 T16971 H&289014 ESTs>WeaMysiriflartoA43932iriucw2p 33 

439310 AF086120 Hs.102793 ESTs 33 

401575 NA TaigetExon 33 

420900 ALD45633 Hs.44269 ESTs 33 

445628 AB44166 Hs.155743 ESTs 33 

45 448243 AW369771 H&52S20 Wegrih. beta 8 33 

445102 AW2O4610 HsJ22270 ESTs 33 

46118 AA976718 HsJB2242 ESTs 3.3 

416783 AA206188 Hs79889 rnorMxytetonracrophagedrifefBJrti^^ 33 

435039 AW043921 Hs.130526 ESTs 33 

50 451474 T70874 Hs707636 ESTs 3.2 

44255Q T10213 Hs.159993 gycosyfoansferase 3^ 

453921 AI824009 Hs.44577 ESTs 32 

420036 R60336 Hs52792 Homo sapiens mRNA;cDNADKF2^586l1823{f 35 

435627 W88774 Hs.118370 ESTs 3.2 

55 411598 BE336654 Hs.70837 H3Mstonetamfly ( member A 3.2 

446733 AA863360 H&26040 ESTs, WeatyslriiBar to ^acH omega 32 

410153 BE311926 Hs.15830 hypoftetica) protein RJ12691 32 

403637 NA C3001106^0047201|dbpAB13394.1I 32 

405547 NMJH8833**tonra sapiens transporter 2, A 32 

60 427878 O057C6 Hs.181022 CGW7 protein 32 

451871 A1821005 Hs.118599 ESTs 32 

410313 R10305 Hs.185683 ESTs 32 

416856 NZ7833 Hsl269Q28 ESTs, WeaJdysimQarto J38022b)^potheii 32 

449490 AI652777 Hs.197069 ESTs 32 

65 450506 NML004460 Hs.418 firxoWastac8vafionproteh. arpha 32 

440684 AG53123 Hs.127356 ESTs, Highly stmflar to S21424 nestJn [H 32 

459055 N23235 Hs30567 ESTs,WeaWys{rnIlartoB34087hypothefi 32 
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452190 H26735 
430985 AA489732 
405394 

424683 BE169810 
454265 HQ3556 
437687 AA765917 
428372 AKD00684 
414083 AL121282 
411670 AWB56552 
416283 NM.0Q5429 
437488 AA758239 
428398 A1249368 
452042 H38857 
421477 AJ904743 
438078 AKM6377 
448816 AB033052 
419519 AI198719 
404580 

447046 AA326187 
457473 AW974903 
429838 AW904907 
459702 AJ204995 
400195 HA 
417860 AW408557 
417995 AW974175 
422589 AA312735 
435870 AA701327 
440601 AA9063G6 
426274 D38122 
423728 AW891294 
439677 R82331 
452834 AI638627 
431349 AA503653 
417576 AA339449 
430264 AA470519 
418827 BE327311 
410835 AW806906 
426289 H153Q2 
405336 MA 
437783 AI683150 
440931 AI583052 
455945 BE160636 
430437 AI768801 
405848 NA 
455685 BE066976 
406970 M29994 
409602 W26713 
423518 D45QZ7 
425653 AJ065104 
426326 BE165753 
433805 AA706910 
437152 AL050027 
448602 A1541305 
452844 AW407181 
407366 AF026942 
408254 AW807227 
424085 NMJJ02914 
416790 R33066 
420020 BE295866 
426119 W94997 
426968 U07616 
457421 AL117431 
453403 BE466639 
454141 AW138413 
426650 AA382814 
450865 AI248013 
407993 AW1 35274 



H&91668 Homo sgpJens done PP1498 unknown mRNA 12 
Hs.154918 ESTs 12 

Target Exon 12 
Hs.47557 ESTs 12 
H&300949 ESTs, WeaWyshtor to thyroW hormone 12 
Hs.122840 ESTs 12 
H&183887 hypothetical protein RJ22104 12 
H&257786 ESTs 12 

gtKRC1-CT0294-0801OM)12^a04 CTQ294 Homo 12 
Hs.79141 vascular endotheBa) growth factor C 12 
Hs.180330 ESTs 12 
Hs.98558 ESTs 32 
Hs>243901 Homo sapiens cDNA FU20738 6s, done HH 3.2 
Ks.104650 hypcfljefcai protefri FU10292 12 
H&1316S3 ESTs 22 
Hsl22151 WAA1226 protein 12 
Hs.176376 ESTs 32 

NMJM4 1 12**tomo sapiens ftfcnorhlnophab 32 
Hs.17170 GprotebK»jpledrece^)r4 12 
H&291231 ESTs 11 
HsJ0732 hypcfte6calpfDtenFU134(»;KlAAl711 11 

gbcan03cO3jrl Stmtagenescf^zo brain S1 11 

hflyUD07057^Hofno sapiens ZWIOirrtera^ 11 
Hs235498 hypothetical protein FU14075 3.1 
Hs.188751 ESTs.WeaWysimtertDMAPB JfUMAN MICRO 11 

Hs30512 Hew sapiens mRr^ for WAA0556 protein. 11 

Hs.17949 ESTs 11 

Hs.190535 ESTs 11 

Hs20O7 faimofnecrosb factor (rlQand) superiaml 3.1 

Hs.132136 solute canter tamfly 4, sodium b'carbon 11 

Hs.164599 ESTs 3.1 

Hs.105685 KtAA1688proteto 11 

Hs.156942 ESTs, Moderately ssmBar to AU£_HUMAN A 11 

HsjJ2285 phosptoriosyfg^yciriajnkte forrnyttransfer 3.1 

gknc71f10.s1 NCLCGAPJYl Homosaptens 11 

Hs.47166 HT021 3.1 

gb.-QV4^T0023-16040O-172-d12 ST0023 Homo 11 
Hs.168950 Homo sapiens mRNA; cDNA DKFZp566A1046 (f 11 

Target Exon 11 

H*201550 ESTs, WeaWysitrto to ALUIJiUMAN ALUS 3.1 

HSJ70058 ESTs 11 

gbPM1-HTO422-29129&OQ2-c08 HTO422 Homo 3.1 

Hs.169943 HwnosapieracDr^RJ135©fis,donePL 11 

Target Exon 3.1 

gWMWTTO34a^129W)03c12 BTD340 Homo 11 

Q^xHin^a^h84speiMger^exon12. 11 

Hsl256972 ESTs 11 

Hs.129732 R3H*xiiajh(bto$£^Ie*t^^ 11 

Hs249718 ESTs,WeakryshifotoA46010X-firited 11 

Hs75052B Homo sapiens, clone IMAGEI4098694. mRNA, 11 

Hs.112742 ESTs 11 

gb<Homo sapiens mRNA; cDNA DKFZp566C032411 

K&48778 ntaiproteh 3.1 

H&218377 Homo sapiens cDNA RJ11927 fis, clone HE 3.1 

gfcHcfT©sa4rieredg33mRNA,F 3.1 

gbA«4^T00e2>18020(HX>1^10 STD062 Homo 1 1 

Hs.139226 rejtotkxi factor Cfacfivator 1)2 (40 3.1 

Hs.7043 succrnate^6gase,G^ 11 

Hs£4382 adenosine kinase 11 

Hs.189917 ESTs 3.1 

Hs.173034 err$jMprryBto(S^ 3.1 

Hs.112165 Hornosaplerisc^ajimfis.ctoneMA 11 

Hs.61779 Homo sapiens cDNAFU13591 fis, clone PL 11 

Hs.139336 ATP-hMhg cassette, sub4amSyC (CFTR 3.1 

gbfST96W7Tesfe!H()nx)sapier6c0^5 11 

Hs.106532 ESTs, WeafdyshtQar to (38588 reverse t 11 

Hs.12433 ESTs 3.1 
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448466 H38Q26 
457888 BE219794 
420058 AKD01423 
409248 AB033035 
45Z747 BE153855 
418926 AA232G58 
419346 AI830417 
429826 N93266 
435147 AL133731 
420139 NM.0Q5357 
405609 NA 
404274 

449777 AI971362 
415459 H07118 
415245 N59650 
406291 NA 
414210 BE383S92 
432055 AW972359 
442246 AI791988 
451353 N21043 
451177 AI969716 
418026 BE379727 
401325 NA 
409920 BE169746 
432887 AI926047 
411789 AF245505 
401045 

432162 AA584062 
434627 AQ21894 
442611 8E077155 
425477 AW956879 
457183 H91882 
433014 NMJH4711 
415542 R13474 
416173 R52782 
408155 AB014528 
453913 AW004683 
435495 AI754212 
423629 AW021173 
411836 AW901B79 
415030 D31118 
419606 AW294795 
440310 AA878939 
443608 AT375957 
414140 AA281279 
444781 NMJM44Q0 
435393 AA701259 
454071 AI041793 
446922 BE175605 
448062 AW295923 
427711 M31659 
450362 AA397658 
424866 W01938 
433043 W57554 
420802 U22376 
445625 BE246743 
403677 NA 
411093 BBOS7650 
435255 W87434 
443127 BB681Q2 
448104 AI674818 
427315 AA179949 
430414 AW365665 
423600 AI633559 
458562 N34128 
402109 NA 
429629 BE501732 



arrestin 3, retinal (X-arrestin) 
ESTs 

Homo sapiens cDNA FU1 0561 fe, clone NT 
K1AA1209 protein 
IgsuperfamDy receptor LNtR 
UDF^!ucose^)ya^rotein glucosyttransfe 
potybfomol 
ESTs 



Hs.308 
Hs393471 
Hs.94694 
HsD1965 
H&61460 
Hs.105794 
Hs.44143 
Hs.40747 

Hs.4774 Homo sapiens mRNA; cDNA DKFZp761C1712 (f 3.0 
Hs.95351 " 



3.1 
3.1 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 



Bpase, homjone-sensltive 3.0 

ENSP00000241065 a tCDNA 3.0 

r^002944^Hornosapiemv^ avian UR2 3.0 

H&231945 ESTs 3.0 

Hs.6099 ESTs 3.0 

Hs*27252 ESTs 3.0 

Target Exon 3.0 
gb£01297871Fl NIH_MGC_19 Homo sapiens c 3.0 

ESTs 3.0 

ESTs 3.0 

ESTs 3.0 

ESTs 3.0 

tatty add binding protekU, adipocyte 3.0 

C10000447^I|1168375JspP434€7|AGA1_PED 3.0 



Hs^93334 
Hs.129115 
Hs.42932 
Hs.13034 
HsD3213 

Hs.12504 
Hs.162859 
Hs.72157 

Hs272798 

H&39311 

Hs.177537 

Hs^70535 

Hs.118569 

Hs^79912 

HS29Q263 

Hs.43133 

Hs.78934 

H&21951 

Hs.18612 

H&314453 

Hs.191735 

Hs.198529 

Hs.125406 

H&289074 

H&23317 

Hs.11950 

Hs.189299 

Hs.42502 

Hs£55472 

Hs.180408 

H&60257 

HS337243 

Hs.125019 

Hs.1334 

H&2E&529 



Hs.106015 
Hs.180312 
H&316433 
Hs.175563 
Hs.120368 
HS310359 
Hs.145268 

H&30622 



Qkelyc^okig of mouse Ancadia 3.0 

ESTs 3.0 

DKFZP564H922 protein 3.0 

C11001883W53278|refiNP_033938.1| c 3.0 

hypome%alpic4BhRJ20413 3.0 

ESTs 3.0 

hypofoefoaJ protBh DKFZp761B1514 3.0 

ESTs 3.0 

Dvt-bkxfhg protein IDAX (inhibition of 3.0 

WAA0419 gene product 3.0 

ESTs.WeaWysiinBartoDSO^hypotnet} 3.0 

g b^g99d09j1Soares Want brain 1 NIB H 3.0 

K1AA0628 gene product 3D 

mutS(E.cc^hfflro!og2(cofewcafK»r. 3D 

Homo sapiens Xq pseuooautosoroa) region; 3D 

Homo sapiens cDNAiFLBigOSfis. ctone H 3D 

ESTs 3D 

hypofoefca) protBh MGC1052O 3D 

ESTs 3D 

ESTs 3.0 

F-tat only protein 22 3D 

hypofteM protein RJ14681 3.0 

GPt^nchoiedmetetasls^sscdated prate 3D 

ESTs 3D 

ESTs 3D 
a^C&HTOmi00500D224<07 HT0560 Homo3D 

HAA1843 protein 3D 

solute earner family 25 (mitochondrial 3D 

Homo sapiens cDNA FU13598 fe, done PL 3D 

ESTs, WeaWy simter to ALU7 JiUMAN ALU S 2D 

lymphoid nuctearproteki (LAF-4) mRNA 2D 

wnyb avian myetoWa^ 2D 

hypofteticaipfotBk>FU22635 2D 

C40014625il4887715]gb|AAA79329^ (L088 2D 
gbMW-BnO5^050^)0We0i BT0358Homo2D 

ESTs,MoteiateiysiniartoAUi1JiLW^A 2D 

nflDChoitc^rfbosom^ 2D 

Homo sapiens cDNAFU11375fis, done HE 2D 
Homo sapiens mRNA; cDNA DKFZp564N0763 (f Z9 

ESTs 2D 

ESTs 2D 

ESTs 2D 

Target Exon 2D 

Homo sapiens cDNA FU13010 fis, clone NT Z9 
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442296 


AIR7794A 

IXtttCt £rtQ 




Homo sapfens cDMA RJ11469 fis, clone HE 


2.9 


419752 


AA24S573 


Hs. 15261 8 


ESTs, Moderately ^^rtoZN91JiUMAN Z 


Z9 


404721 






NM_CQ5596*:Homo sapiens nuclear factor 1 


19 


445107 




Ha. 147313 


ESTs. WeaWy stosterto 138022 bypofliefl 


2,9 


401987 






NM-002737**>lDmo salens pcoteln Wnase C 


Z9 


40U300 


AA4JH9A9 


Uxt 1QTHAQ 
na.iSfU149 


ESTs 


Z9 


444517 




H&14fififl3 


ESTs 


23 




AWR7%flfi 
HrfOr JOUD 


I19LIWUUU 


ESTs 


23 




AW1Q449R 


Lie nrrnc 


ESTs 


23 




MJ0O0O04 




ESTs,WeaWysfcnaartoT32250hypo»^ 


19 


www 






TaigetExon 


23 






Uq 4,fY19QQ 


cu&i5 


23 


4O90U4 






gb£01315974F1 NlH_MGCJB Homo sapiens cD 25 


4Z4gu 


MM MVXtft 


nS,10Jw4 


TRAM-fiteproleh 


23 


4ttxnr 


DDUUj/DU 


Ks.55209 


Honiosap^ntf^cDNADKFZp434K0514a2.9 


410991 




Lie OAJOC 


adfoose most abundant gene traftsci'ipl 1 


23 


41o£cl 


DC4C44C4 

DC10T191 




gbJ 5 MOHTD4»-14l29W)0VF08 HT0425 Homo2.9 


Jincrrto 
4U3# JZ 




Uq CC4 ad 


NY-REN-58 antigen 


23 


4o30or 






glmy57g01^l Na^CGAPJYIS Hmro sajfens 2.9 


434340 


AI193043 


nS.iZoOK> 


ESTs, WeaWy s&nlar to T17226 hypotofi 


23 


494029 


Z45439 


rlSJc/U4£> 


ESTs 


23 


421379 


Y15221 


nS»1039B2 


smal Inducible cytokine autfamOy B (Cy 


23 


452291 


ArU15o9Z 


Hs*28fK>3 


CDC7 (eel division cycle 7, S. cerevlsl 


2.9 


457402 


AW452dw 


Un 

nS. 149342 


activation-induced cy&fine deanAiase 


23 


449051 


AW9614W 




HTO nxeptor tyrosine kkiase {o&MxZ 


23 


408761 


AA057264 


Hst238936 


ESTs, Weatty stmBar to (deftie not ava 


23 


401093 






C12000586^gq6330167|dbpAA86477.1 1 (A 


23 


435061 


AI651474 


rB.163944 


ESTs 


23 


447985 


AI681475 


n&2UU949 


ESTs 


23 


449340 


AW235786 


Hs.195359 


hypofhefical protein MGC10954 


23 


426384 


AJ472D78 


Hsl303G62 


ESTs 


23 


411905 


BE265067 




gbc601 193893F1 NlHJb<GC_7 Homo sapiens cO IB 


405953 


UA 

NA 




Texget Exon 


23 


420854 


AW2969Z7 




ghAJWWVVaajw>07-4>4jU1 NCLCGAP.Su 


23 


434265 


A ACMC044 

AAo46o11 


KS.13D354 


Homo sapiens cONA: FU23089 fis. clone t 


23 


428365 


AA295331 


Un 4MHC4 

HS.i83od1 


Homo sapiens cDNA RJ20042fis t doneCO 


23 


442861 


AAZ43B37 


Hsl57787 


ESTs 


28 


448937 


AW20D40J 


Un 97DO 


ESTs 


23 


452554 


AW45Z434 


Un conne 


ESTa, Weakly sfrrto to AU^JiUMAN ALUS 


23 


41Z24B 


Bel 76460 




gW^C34fr0585-1603(MM>22-cQ2 HT0585 Homo U 


44340U 


Al MOOE9 


Ub 1MQC 


ESTs, Moderately sfmaarto AUU7_HUMAN A 


18 


434757 


ARJ3B99/ 


Urn 443004 


ESTs 


18 


408038 


(9/490 


Un OWB 


smal Inducible cytokine subfemSy A (Qf 


IB 


404340 


AVVoUCxKrI 




gb^^T0023-160400-172-c12 ST0023 Homo Z8 


439942 


AidlOoSo 


Ut. 404440 


ESTs 


18 


4ZB4f3 




U» 4Qif%70 


eel dMsbn cyde 2, G1 to S and G2 to 


18 


4*MWf 


Ai mccso 




MyoDfiarnBylnhfiAor 


18 


411W0 


DCU/UOUU 




gb«C3OT)502-25119^011-c07 BT0502Homo 2A 


4UUQU 


NA 




Eos Control 


23 


4491t» 




LU4<14M 

nSxJl4£ 


colon cardnoma reSated protein 


18 


40D4CK 






gbaa41b12^1 NCLCGAP^GCBl Homo sapiensZS 


42i044 


AA502490 


Hs»336695 


ESTs 


18 


431854 


AA383550 


Hs^71699 


polymerase (DNA (Greeted) iota 


18 


405873 


NA 




TaigetExon 


18 


440400 


AA994364 


Hs.125594 


ESTs, WeaWysimaar to 725472 hypolhefl 


18 


458265 


AI075375 


Hs.128193 


ESTs, Wea«ystoiBarton«LHUMAN ROQU 


18 


413708 


BE158791 




gb:L2-HTO397-0912»4a254X2 HT0397 Homo U 


423739 


AA398155 


K&97600 


ESTs 


16 


424408 


AI754813 


Hs.146428 


coQagerv type V, alpha 1 


18 


453096 


AW294631 


Hs.11325 


ESTs 


18 


421825 


AA298758 


Hs.183747 


ECTs.Mooeratelyshn»artoCAiBJMV^C 


18 


417742 


R64719 




gbfST22d11 WATM1 Hc4TOsapten3c0NAcion2.8 


402765 






C1OX3621^gi712407405teb(M(^91.1|AF22 23 


444378 


R41339 


Hs.12569 


ESTs 


18 


419172 


AW338625 


Hs.22120 


ESTs 


IB 


401497 






Target Exon 


18 


402376 






Cf 900076^^363912|pi7l|JC4ffl6 ring f 


23 
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405041 
408758 
431917 
437583 
453737 
458094 
401283 
410784 
417601 
418236 
435532 
454714 
418629 
4421C1 
405080 
414681 



429638 
428824 
421566 
4145S6 
440868 
452943 
443772 
432361 
430375 
406504 
423279 
424871 
453619 
423961 
422156 
415752 
419987 
406182 
416495 
444701 
408171 
430153 
413383 
414831 
413278 
433132 
437030 
439031 
449532 
406153 
406625 



NA 

NMJXJ3686 

D16181 

AA761190 

AA744862 

AF086325 

NA 

AW803201 

ML014735 

AW994005 

AW291488 

AW815098 

BE247550 

AI651930 

AKD00375 

T97401 

AJ550833 

A1916662 

W23624 

NMJD00399 

BE386870 

R79707 

BE247449 

AV646449 

A1378562 

AW371048 

NA 

AW959861 
NMJ04525 



432328 
429628 
420149 
431207 
438394 
443304 
427660 
408460 
416515 
429822 
418203 
439509 
402184 



451963 
4579ffl 
441541 
441111 



N34524 
BE314524 
NM.005O14 
NA 

X69970 

A016512 

AA301228 

AW968128 

AA128976 

M31158 

BE563085 

ABQ26264 

AA742577 

AF075079 

W74653 

Y13647 

AH88139 

AB7Z739 

KD9604 

AA255920 

AA495925 

BE379623 

AJ050073 

AI741320 

AA054726 

N91716 

297630 

X54942 

AF086332 

NA 

AW449251 

AIB25440 

AI373638 

AA938663 

A1806867 



C3001706^1345652|5pP15989|CA36.CHIC 28 

Hs.47504 ©©nuclease 1 28 

Hs2B68 peripheral myelin protein 2 2.8 

H&244627 ESTs Z8 

Hs.194293. ESTs, Weakly simBarto 154374 gene NF2 28 

gb^omo salons KAIengfi)bsertcDNA 28 

Target Bcon 2.8 
gb^Amn7mmm^^[MXfnHomoZB 

KtAA0215genepmduc( 28 

ESTs 28 

Homo saltans, ctae IMAGE3SI2908, mRNA 2B 

gb:QV4-STtl212-09119^O23-f10ST0212Homo 2.B 

growth factor receptor-bound protein 7 2B 

ESTs IB 

HDCMC28P prate*) 28 

ESTs 23 

Homo sapiens cONA:FU23031fi3; done L 28 

ldrecon1(khesin receptor) 2.7 

ESTs 27 

early growth response 2 (Krox-20(T^^ 27 
gbJ601275271F1 NWJ«3CL20 Homo sapiens c 2J 



Hs32292 
Hs337534 
Hs.117305 

Hs.86859 
Hs.135684 
HsJ6820 
Hs^1929 
Hs.143688 
Hs£11577 
Ha173Q59 
Hs.1395 

Hs363339 

Ha31082 

Hs3B2872 

Hs.159585 

Hs*3758 

1&290943 
Hs.153595 
H&33922 
Hs.136348 

Hs.78776 
Hsl94070 

Hs.79350 

Hs.198394 

Hs.43299 

HsJ36679 

Hs.154706 

Hs77439 

Hs*33 

Hs284245 

Hs^03781 

HsZ71593 

Hs.1 19597 

Hs.147050 

Hs.195471 

Hs.13268 

Hs£8085 

Hs3394 

Hs27683 

Hs.135338 

Hs.114121 

Hs285574 

Hs.194140 

H&226117 

Hs33758 

H&58314 

Hs357131 
Hs224952 
Hs*133900 
HS.199B28 
Hs.126594 



ESTs. Moderately sonfer to 138022 hypot 
hypoSieficat proteh FU10525 
ESTs 
ESTs 

H4 Wstooe tarnSy, member H 
(S0(X^^4504675tn#lPJJ02175.1 1 int 
ESTs 

kw density Bpoprotern-rebfed protein 

Homo sapiens, done MGC3084, mRNA. comp 

hi In nl-ilii nl rin ri nfflr» f mIj ■ ^ /gimjiLJli. 

OSfPv t'T fffff SPCCnPF khJPw * ItaSuCul 

gfacyy56d103l SoarB^jnu^lejsderosls. 
putaSve transmembrane protein 



Target Exon 

RYK receptor-Ike tyrosine kinase 
ESTs 

hypothetical protein FU128&) 
ESTs 

hypoftefcal proteh RJ14917 

protein kinase, cAKP-dependent regulato 

(nterferon-s&nubtBd protein, 15 kOa 

hypoftefical pioteh IMPACT 

EST 

gfcHomo sapiens fill lengm tosert cDNA 
ESTs, Moderately similar to A47582 B<el 
Tamer. Exon 



ESTs 

6fhosphomictD4tirase/^^ 

ESTs 

ESTs 

ESTs 

pep8dytiaoryt isomerase (cydophiBnH 
ESTs 

Homo sapiens cDNA: RJ23228 fis, done C 
ESTs 

ESTs, WeaJoysimSar to 138022 hypdhefi 
HI histone famBy, member 0 
CDC28 protein kfoase 2 
ESTs 

ENSP00OXKM5238*rt»NA FU10922 fis, don 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 
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27 
27 
2.7 
27 
27 
27 
27 
Z7 
Z7 
Z7 
Z7 
2J 
Z7 
Z7 
2.7 
2J 
2.7 
Z7 
Z7 
Z7 
Z7 
Z7 
Z7 
27 
27 
Z7 
27 
27 
27 
27 
27 
27 
27 
27 
27 
27 
27 
27 
27 
27 
27 
27 
27 
27 
27 
27 
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44S234 
413903 
406069 
447410 
401256 
415139 
420218 
455611 
438825 
409564 
452837 
434876 
421565 
453279 
430765 
45S986 
433068 
421952 
429208 
430733 
441720 
418986 
432481 
434338 
417061 
410530 
4K672 
425071 



451531 
405822 
418301 
417315 
434699 
443204 
405638 
452542 
403943 
404535 



AA383092 

AV653485 

AW293165 

X91662 

AW137636 

AM96493 

NA 

AM70235 
NA 

AW975942 

AW958037 

BE144762 

BE327427 

AA045857 

AL121053 

AF160477 

AKD01122 

AW893940 

Z302O1 

D38299 

NH.006456 

AA3QQ900 

AA447990 

AW975920 

AB46487 

A1123555 

AW451645 

AW754311 

AIS75944 

M25809 

AKD02016 

NM.013989 

AW292286 

AA018311 

AW976201 
A108C042 
AA643687 
AW205878 

AW812256 



Z25884 

. NA 

45 449144 AI989503 
454934 AW846080 
424717 K03754 
428303 AW974476 
427970 AA418187 
50 450638 AKD01826 
BE246010 
AW655802 
ABQ28955 
NMJH5434 
247542 
NMJ000163 



453034 
455097 
427317 
408875 
427510 
423201 
406271 
442696 
454018 
435420 
434398 
455708 
439347 
407523 
425101 
435153 
409139 



AW016892 

AI928513 

AA121098 



W24320 

X64984 

AA830431 

AA668763 

AJ881917 



Hs.1608 
Hs.6390 
Hs.143134 
H&66744 
Hs.146059 
Hs23136 



repficaiion protein A3 (14kD) 2J 
Homo saptens ctena RB3344 PRO0845 mRNA. 2.7 



ESTs 2.7 

twist (Drosophfta) homotog (acrocephabs 2.7 

ESTs 2.7 

ESTs 2.7 

TaigetExon 2.7 

Hs.172698 EST 2.7 

NM^024089^omosaptohypofl)^pro 2.7 

Hs.48524 ESTs 2.7 

H&286 ribosomalpmteinU 2.7 
glxCMDW018W)4109WI6W)04 HT0180 Homo2.7 
ESTs 

fracture cafes 1 (raf) tomofog 
Horm> sapiens cONAFU12961f^ clone NT 
tgsupeifa^ receptor LN1R 
hypoftefcaJ proteto FU10260 
ESTs 

gbflHEA22G Atom cDNA flbrary Human hea 
prostaglandin E receptor 3 (subtype EP3) 



Hs.79953 

Hsl54943 

Hs5534 

F&61460 

H&1Q5B59 



Hs,170917 
Hs388215 
HsS8849 
Hs.190478 
H&283361 
H&28739 
H&8179S 
Hs.151504 

Hs.168691 
Hs54173 
Hs.114727 
Hs.154424 
H&255058 
Hs.114762 

Hs53913 
Hs336901 
Hs.149425 
H&29643 



ESTs. Mode*afe)y^mnartoAF161511 1 H 

ESTs 

ESTs 

ESTs 

ESTs 

Homo sapiens cDNA FU11973 fis, done HE 



2.6 
ZB 
2.6 
2.6 
2.6 
Z6 
16 
2B 
ZB 
ZB 
2B 
2B 
2.6 
2.6 
2B 



Hs.121483 

H&233405 
H&314324 
Hs.152213 
Hs.183601 
K&330515 
Hs^5245 
H&271468 

Hs.175780 
Hs.48604 
Hs.179312 
Hs.125180 

Hs.7063 
Hs.100855 
H&59203 
Ha3838 

Hs.102941 

Hs.180811 
Ks^91939 
H&3321 



gW^1-CT0337-14129W)6W07 CT0337 Homo 2.6 

Horro sapiens cONARJ12033tls, clone HE Z6 

ATPase, H transporting, lysosomal (vacuo 2.6 

Homo sapiens, to M^:16327,rnRh^ com 2B 

dalodfrase, tocfcthyrorune, type U 26 

ESTs 2.6 

ESTs 2B 

Target Exon 2B 

hypc*eik^pnrteinRJ1Q252 2B 

ribosoma) proton S24 2.6 

Homo sapiens CDMAFU11980 6s, clone HE 2£ 

Hccrosaplefis(£>rMFU13103 ketone NT 2.6 

Target Exon ZB 
gbAC^STO174-i910994)31-a07 S70174 Homo 2.6 

C50003SS^4503225|reqNP M 0007G5Ll| cyt 2.6 

chtaktecbajmeH.stelettrn^ 2.6 

Target Exon 2.6 

ESTs 2JB 

ESTs 2B 

wingless^ Mm Weg^ta site laml ZB 

regthalorof G-protehsignaOktg 16 2B 

ESTs ZB 

hypo&teQcat protein FU11269 ZB 

Hera sapiens mRNA (or Fl^X)038 protein, 2.6 



gfcRC1<rra279-17Q200M>23^ CT0279 Homo ZB 

WAA1032 protein ZB 

DKFZP434B168 protein 2B 

sn^rwdearRNAacfiva^amiple^po 2B 

growth hormone receptor ZB 

Target Exon ZB 

Homo sapiens cDNA:FU20913 fls.ctone A 2B 

ESTs 2B 

ESTs 2& 

^pfifffrfrwftiriMfr K^ 3 ^* 2.6 
gb^VMT0381-17010O«6&gO3 BHTO81 Homo 2.6 

Homo sapiens cDNA:RJ21531 8s, done C 2.6 

gteH^ap^rriRNAHTPCRXIOforofect^ 2.6 

ESTs 2.6 

ESTs 2.6 

ESTs, Highly s&nferto IRXIJttJMAN IROQU ZB 
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455100 
414612 
440283 
423025 
431473 
404440 
403388 
403775 
405037 
407447 
420952 
435447 
440202 
445854 
421247 
414B7D 
457411 
424676 
404443 



430832 
444779 
406633 



447197 
454111 
411465 
406922 
456045 
413111 
423123 
432201 
427032 
445417 
4 22223 
428330 
410011 
426310 
419589 
437770 
442818 
414251 
428301 
431956 
455732 
458624 
428257 
416064 



BE160198 
BE274552 
AI732892 
AA831267 
AA825686 

HA 
NA 
NA 

AF290544 

AA282067 

AI872932 

AW516211 

AJ702885 

BE391727 

N72264 

AW085961 

Y08565 

NM_003512 

A1073913' 

AJ192105 

AW963372 

F13036 

R36075 

AW081681 

MMUJ00169 

S70284 

H62943 

BE065837 

NM.012247 

AI538613 

AF012023 

AKD01058 

BE245652 

L22524 

AB020641 

NM.00O9O9 

AW973708 

AA767881 

AKD01741 

AL042306 

AW628666 

AKW2Q32 



gh^14m41W»1Oa)W»5^h03HTD413 Homo 2.6 

H3.76578 protein Inhibitor of acfivated STAT3 2.6 

Hs, 190489 ESTs 2.6 

Hs.12244 hypothetical pro to BJ20097 2.6 

Hs.321176 ESTs, Weaty similar to S65B24 reverse t 2.6 

NM_Q21048*tomo sapiens mefenoma antigen, Z6 

C3001398^gi|12248917|dbpAB20375.1| (A 2.6 

TaigetExon 2.6 

NAU^^HomosapierBarachUonatet^ 23 

gfcHomo saptens amtoopepiidase mRNA, par 23 

Hs.88972 ESTs, Moderately simter to A46010 X-fci 2.6 



Hs.125300 
Hs.145568 
Hs.102910 
H&300670 
Hs.130093 
Hs.151678 

Hs^8777 
Hs.100686 
Hs.147170 
Hs.46677 
Hs^7373 

Hs369064 
H&39089 

Hs.154188 

Hs.124027 
H&298241 
Hs.173274 
H&12680 
Hs.118281 
H&2256 
Hs57856 
Hs.169266 
Ha2M925 
Hs.122897 
H&87& 



AB62790 
BE394723 
rAL0Q2318 
AW375610 
50 458012 AJ424899 



410804 
457211 
440029 
55 448141 
409163 
431385 
414569 



60 
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455935 
425025 
416569 
404826 
422838 
421991 
417404 
448516 



U64820 

AW972565 

AW089705 

AI471598 

AA065081 

BE178536 

AF109298 

AW247529 

BE158687 

AW953168 



NNL001809 
NHL014918 
NW-007350 



gbnwn72e03jc1 NCLCGAP_UI2 Homo sapiens 23 

i ringfe)gerproten21 > trtofeiorHespo 23 

ESTs 2.6 

genetaJtraieoiption factor IH polype 23 

HAA1204proteh 2.6 

ESTs 23i 

UDP4*acetyte!pha4>^^ 2.6 

C8001428^gt|6572242|etnb|CAB62951.1| (Z9 Z6 

H2A WstonefemRy, member L 2.6 

ESTs.WeaMysiriBTartoJB0350Arfertor 2.6 

ESTs 23 

PRO2000 protein 2.6 
Honw sapiens mRNA;cO^OKFZp56401763(f 2.6 

gb.7h88b01^1 Scares pbcerdah!b2HP Homo 23 

ESTs, WeaWyslntotoT42689hypo8jefi 23 

gatoctosidase, alpha 23 

gfcstearoyt-CoA desaturase [human, atfyo 2.6 

ESTs 23 
gb:RC2-BT0318-1 10100412^12 BTD318 Homo 23 
SELBJOPHOSPHATE SYNTHETASE ; Human setenZ6 



Transmembrane protease, serine 3 23 

totegito cytopJasmcdcfrafress^ 23 

Hcfno sapiens cDNAFU10i96 lactone HE 23 

zinc finger proteh 266 23 

rratrixrnetaDoprotemase7(MMP7;ute^ 23 

PFTAIRE protein kinase 1 23 

neuropepftteY receptor Y1 23 

Homo sapiens cDNA RJ1344S 8s, ctone PL 23 

ESTs 25 

hypometfca! protein RJ10879 23 

VASA protein 23 

Hs.98440 ESTs,Wea)(^sbTdartol38022hypothe6 23 

Hs*272245 Homo sapiens CDNARJ11170 fis,ctone PL 23 
gb^-BrTOK1-280200484W7 BTD631 Homo 23 

H&278639 K1M1684 prc4eln:^rrt)rootog rfrrous 23 

HsJ75243 S100cakxi»biF)dtogpn)teinA6(cak^ 23 

Ks33354 lysyf coddase-Ooe 2 23 

H&117102 hypofte^pR^FU13046sim3artD Z5 

Hs.188211 ESTs 23 

gbd3M4-BTOTO4-13040(H)0^ 
Hs.66521 Machado^lcseph disease (spfr«x»n^ 
Hs32399 ESTs,WealdyslrrtotoS51797v8S0d3al 
HsJ93711 ESTs, WeaJdy similar to S64329 probable 
Hs.197531 ESTs 

gtxzm13a<&s1 Stratagene pancreas (93720 
Hs.11090 membranwpanntog 4Ktomair^ 
Hs.116258 prostate cancer associated protein 1 
Hs3793 pteteiet*c8vafirfl factor*^ 

gW»«mT0395-28(«0O-16Wj09HnS95 Homo23 

Ha.12407 ESTs 23 
Hs.96151 HurrOTDNAse^»x» mom done RP5-1103G7 23 

Target Exon 23 

Hs.1594 centromere protein A (17MD) 23 

Hs.110488 KIAA0990 protein 23 

Hs32101 ptecfcstrhhwnolagy^ 23 
gb^C1-NN0073>26O40(W)11<i09 NN0073 Homo25 
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418282 AA215535 
427409 AW467143 
431806 AF186114 
443367 AW071349 
421246 AW582962 
439217 APD86041 
400925 
404552 NA 
417168 AL133117 
418841 NMJH2332 
426853 U32974 
427738 NM.000313 
457384 AA5O1760 
447128 AE71898 
454693 AW813428 
434657 AA641876 
402077 NA 
400289 XD7820 
.409723 AW885757 
447020 T27308 
455068 AI807894 
431232 A1Q24353 
408938 AA059013 
411571 AA122393 
426504 AW162919 
428248 A1128772 
408813 AI580090 
423504 N80077 
425441 AA449644 
443066 AW297921 
443556 AA256769 
428943 AW0861B0 
425320 U29344 
430388 AA356923 
423242 A10394Q2 
416241 M52539 
440244 AI743977 
409239 AA740S75 
452464 AW50Q507 
410718 AB20783 
408877 AA479033 
445150 AI446747 
407756 AA116Q21 
407633 KMJW7CS9 
449754 H00820 
419316 AA236255 
429118 H20669 
440331 AI046412 
449344 A1640355 
459008 AW298631 
423165 AI937547 
411337 AW837349 
438290 AA843719 
408414 

424498 AB033043 
443464 BB48446 
424856 AA347746 
440304 BE159984 
409045 AA635062 
.422648 D86983 
428819 AL135623 
412520 AA442324 
430602 D13752 
408031 AA081395 
403133 



Eh<SP00000251525^Hypoth8t{cai protein W 25 

B4SP00000252242*^efa^,lyp8n^tosk 25 

H&98133 ESTs 25 

Hs.135411 acfin related protefo 25 

Hs270737 tumor necrosis factor (Pgand) superfami 25 

Hs215937 ESTs 25 

Hs.102897 CGM7 protein 25 

Hs.42975 ESTs 25 

TaigetExon 25 

ENSP00000220888*-.Z1NC FINGER TRANSCRIPI125 
Hs.81378 Homo sapiens mRMA;cDNADKFZp586L1121{f25 

Hs£9137 towdens^DpoprotBfHBta^prDteV) 25 

Hs.172777 b3CutovMWn^eak»ntafri)ng4 25 

Hs.180612 peroxisomal membrane protein 3 (SSkD.Ze 15 
Hs.15806 Hero sapiens mRNA;cONADKFZp4MH&19fl 25 

Hs.164866 cycfin K 25 

gfacMtt-STO192O1O20O^OcO5 ST0192 Homo 25 

Hs.191840 ESTs 25 

TaigetExon 25 

H&2258 nratrtxmetaltoprofemaselO (MW>10;str 25 

Hs357862 ESTs 25 

Hs.16986 hypotheOcal proteki FU11W6 25 
Hs.47274 Homo sapiens mRNA; cDNA DKFZp564B176 (fir 25 

Hs.131755 hypofoettad proteki RJ14298 25 

H3L22607 ESTs 25 

Hs.70811 hypo&>e8calprot^RJ20516 25 

Hs.170160 RAB2; roemberRAS oncogene fem^fflte 25 

Hs.40479 ESTs 25 

Hs.48295 RNAteficasefemfly 25 

Hs24792 chromosome 12 open reading frame 5 25 

Hs.193063 Homo sapiens cDNA FU14201 fe, clone NT 25 

HSt255703 ESTs 25 

Hs£4949 methykralonyW)oAepanerase 25 

H&37636 ESTs. Weakly similar to K1AA1 392 protein 25 

Hs£3190 tafiyacidsyntae 25 

Hs^40770 mic^capbJna^proteinsubuna2 t 2 25 

Hs.125783 DBAE-6 protein 25 

Hs32683 ESTS 25 

Hs205144 ESTs 25 

Hs.44307 ESTs. MaleratelysMar to 138022 hypot 25 

Hs.192619 KIAA1600proteb) 25 

Hs.191435 ESTs 25 

Hs.130315 ESTs,Wea)dyslmaartoA47582B<»flgr 25 

H&338704 ofec^ receptor, tamBy 7, subtan% 25 

Hs38260 ubio^spec& protease 18 25 

H&37189 sbnBartoratHREV107 25 

Hs30977 ESTs,Wealdys^toB34C87hypoliefi 25 

Hs298419 ESTs 25 

Hs35406 ESTs,Hlgh^shtetoiHirtaniedp(otein 25 

H&2Q2151 ESTs 25 

H&312691 ESTs 25 

Hsl27721 WtoWirscrihomsynoVom^ 25 

Hs.124915 hypofoefcal proteto MGC2601 25 

gbtf mTO038427O3O(M0&<t12 LT0038 Homo 25 

Hs.122341 ESTS 25 

C5000506 i igQ124941^ppi8614[rrA1_RATf 25 

Hs.149377 rtypotefoal protefc DWFZp761U0424 25 
Hs5167 Hofi»sapieisinRNA:cDNADKFZp434F152{fr 

Hsl9521 ESTs,WeaWysiitotoZN43JiL»WNZWC 25 

Hs.125395 ESTs 25 
H350094 Homosapkrerni^c£MDKFZp4* 

Hs.118893 Melanoma associated gene 25 

Hs.193914 KIAA0575genapnxfajct 25 

H&795 H2ANstDnefan%,menteO 25 

Hs.184927 cytochnmP450.sub^)QB{sterok) 25 

Hs.42173 Hccno sapiens cDNAFUl 0366 8s, clone NT 25 

TaigetExon 25 
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413189 88170231 gbXM-OTW-260100-087-f12 BTO407 Homo 2.5 

40034$ AB041269 Hs#2263 Homosap^mRNAfofkefafin19 f pafe 23 

435509 AI458679 Hs.181915 ESTs 23 

458145 A1239457 Hs.130794 ESTs 23 
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Table 19A shows the accession numbers for those pkeys lacking unigenelD's for Table 19. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Bos protest Member number 

CAT number Gene duster number 
Accession: Genbank accession numbers 



Pkey 

407647 
407980 
408254 



409685 
410534 
410672 
410784 
410785 
410835 
411050 
411086 
411033 
411111 
411171 
411337 
411514 
411670 
411905 
412102 
412209 
412248 
413043 
413111 
413189 
413221 
413499 
413708 
414210 
414596 



1007366.1 

103087J 

1049346J 



10 



15 



20 



25 



30 



35 409163 110418.1 



CAT number Accessions 



114876.1 

1207247J 

12M882J 

1221005.1 

1221055.1 

1223785.1 

1230330.1 

123150OJ 

1231970.1 

123266SL1 

1234393.1 

1239217J 

1248638.1 

1253680J 

1265181J 

1277395J 

1283610.1 

1285000.1 

1346556.1 

1349546.1 

1352723.1 

1353887.1 

1373910.1 

1384140.1 

1426051.1 

1465004.1 



AW860158 AW862385 AW860159 AW862386 AW862341 AW821869AW821893AW0S266OAW062656 
AA046309AKS63500AA046397 

AW807227 AW807576 AW807137 AW807157 AW807495 AW807494 AW807417 AWB07083 AW845786 AW845801 
AW807130 AW807335 AW807081 AW807349 AW807339 AW807164 AW807341 AW807224 AW845903 AW177424 
AW807159 AWB07123 BE141576 AW807340 AW807334 AW807520 AW807205 AW807505 BE141574 AW807390 
AW807396 AW845789 AW807101 AW8O7D89AW8O7519AW807239AV^5O9AW^^ 
AW807307 AWB07153 AW807295 AW807313 AW807322 AWB07265 AW807513 AW807516 AW807166 AW807501 
AW807120 AW807168 AW807121 AW807527 AW807151 AW807203 AW807489 AW807511 AW807158 AW845800 
AW807507 AW845793 AWB45796 AW807296 AW845781 AW807444 AW845871 AW807512 AW807242 AW807141 
AW807522 AW807487 AW807514 AW807142 AW807232 AW807379 AW807114 AW807518 AWB07199 AW807211 
AW80749B AW807086 AW807492 AW80721 8 AW807D82 AW807525 AW807493 AW807523 AWB07087 AWB45784 
AW807037 AWB07128 AW807080 AW807118 AW845807 AW807524 AW845803 AW807249 AW845795 AW807160 
AW807343 AWB07515 AW807233 AW8072B9 AW177102 AW807352 AW807394 AW177105 AW807175 AW177103 
AW845870 AW177099 AW177101 AW807528AW807338 AW807038 AW177100 AW807411 AW807088 AW845865 
AW807226 AW807517 AW807397 AW807303 AW807177 AW807154 AW807136 AW807146 AWB07085 AW807521 
AVV807488AW807385AW807355AW807223AVV807155 

AA065081 AA075017 AA084791 AA071015 AA081560 AA071459 AA545727 AA083100 AA085366 AA115845 AA075457 

AAD64704 AA082878 AA075742 AA0691 62 

AA2S6961AA296889AA076945AA077528AAO77497 

AW905138AW753008 R13818Z43519 

AW794600AW794730 / 

AW8Q3201BB079700BE062940 $ 

AWB03341 AW803265 AW803403 AW803466 AW803402 AW803413 AW803268 AW803396 AW803334 AW803355 

AW806906AW806915AW866460AWo^^ 

AW8149Q2 BE156656 BE156667 BE15G590 BE156441 BE156447 

BE070800 AW875226 BE149115 

BE067650AW817053 

AW818127 AW818161 R09719 

AW820260AW820332R94406 

AW837349 AW837355 AW882717 

AW850178 AW850233 AW850445 AW850446 

AW856552AWB61101 AW856574 AW861099 AW861100 AW856573 AW856576 AW856562 
BE265067 BE264978 AW875420 
H56435 H56572AW892929 
AW901456 AW901450 AW901441 
BE176480 AWS03298 AW903313 

BE158766 BE061699 BE147360 BE147362 BB061666 BE081697 BE061647 BHB1678 
BE065837 BE065805 BEQ65799 BE065818 BE065839 BB065831 BBJ65894 BBQ65789 BE065792 
BE070231 BE070229BB070255 

BE161151 BE162495 BE1610Q2 BE072205 BE160989 BE162482 
BE144884 H97942 

BE156791 BE158806ffi158748BE158744BE158740BE158739BE158811 BE158700 BE158741 BE158683BE158685 

BE383592BE281671 

BE386870Z41986H08501 
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414605 


1465790.-1 


Bc39D440 




415747 


155189.1 


AA404<Vtn AA404QJC A A 4C7CQO 

AA3B12Q9 AAoBiZ45 AAlb/bo3 




416173 


1574973J 


nPWOT D4T944 LPM40*) D4O0TC 

R5Z782 K173 13 HZ419Z KlSofo 




417742 


1696282.1 


R64719 Z44680 R12451 


5 


417974 


171237.1 


AA21D7oo 135700 H944U7 




418636 


1774Q2J 


*uwjftfler *A*yuafMc Atirrc/vmo AlArrc/vonC 

AW749855 AAZZ59S5 AW75U2Q8 AWr SUZUb 




419536 


185688J 


AA6033Q5 AA244095 AAZ44183 




420854 


197072.1 


AW296927 AJ684514 AI263168 AA281079 




422156 


212379J 


N34524 AA305071 AW954803 AA5QZ335 AI433430 AuD359f AWUZbO/U AWZt»3Z3 AVvo00/o/ AW1 rm AW&uov 


10 






AW835572AW385512AI334966W32951 H6a656 ro390Z R88904 AWB35/3Z 




422996 


223G66J 


68191089 BE091123 AA319959 




423833 


232451J 


AW503329 M46610 AA331571 




423841 


232507J 


AW753967AA370795AA331630AW962550 




423945 


233565J 


AA410943 AW948953 AA334202 AA332882 


15 


425201 


247933J 


AA352111 AWS62247 AA429695 




426650 


270283J 


AA382814 AA402411 AA412355 




426878 


273265J 


BED69341 AW748403 AL044891 AI908240 AA393080 




430264 


315008.1 


AA470519 BE303O1O BE302954 BE384120 




430785 


323486J 


Z30201 AA486132T72025 


20 


431676 


336411J 


AI685464 AW971336 AA513587 AA525142 




433687 


373061.1 


AA743991 AA604852 AW272737 




434338 


383982J 


AW754311 AA630185 AW803285 




434469 


387447J 


AA634806 C18732 AA729161 AA729860 




435447 


406400J 


A872932AA682306 BE220163WB8695TB1307 H91447 


25 


437152 


43386.1 


AL050Q27BE089051 




437854 


44418.1 


AL119723 AL119874 A1909018 U50537 




439031 


46798.1 


AF075O79H486O1H48795 




439255 


470321.1 


BE164500 AA832198 BE164502 




444910 


624951.1 


AI201849 BE0690O7 AW946544 


30 


445432 


63943.1 


AVG53771BE089370 




446922 


69865L1 


BE175605 Z43529 F06610 BE175602 AV661027 




447197 


711623.1 


R36075AB66546R36167 




448420 


76273.1 


BG623004 AA380669 BE263627 BE246433 




448516 


766241.1 


AW893595 AW89858B AW898590 AW898663 AW898592 A1525093 


35 


450522 


837264J 


AJ698839AB09260A1909259 




450736 


844652.1 


AW970060 AI73Z366 AI792313 AW839644 




451024 


85565.1 


AA442176AA259181 




451067 


85759.1 


BE172186 AA059279 AA02D&15 AA013437 




451340 


86640.1 


AW936273 AW340350 AA017208 


40 


452542 


921410J 


AWB12256 AW812257 AB06423 AI90&422 




452564 


92227.1 


AA026777 N50065 R09961 N54721 




453472 


968371.1 


AL037925 AL037931 ALD37957 




454307 


1106070 J 


AW855717 AW362452 AW362443 




454359 


1130674.1 


N71277AW390764 


45 


454545 


1223779.1 


AW806899 AW866451 AW866393 AW866297 AW817869 




454693 


1229132.1 


AW813428 AWB13444 AW813367 AW813368 AWB13429 AW813424 




454714 


1230493.1 


AW815098 BE154643 BE 154831 




455047 


1250536J 


AWBS2530 AW852527 AW85S26 




455092 


1252971J 


BE152428 AW855572 AW855607 


50 


455097 


1253130.1 


AW855802 AWB55794 AW855797 AW855806 AW855796 AW855808 AW855733 AWBS5BU/ 




455100 


1253334.1 


BE160198 AW935898 T11520 AW935930 AW856073 AW861034 




455431 


1ZB9B54_1 


AlMMOjtQl DCfWIOA^ RPfVHIQft 
Airg«)o<Hn DCUUilw 




455511 


1321229.1 


BE144762AW979091 




455609 


1337548.1 


BE011668 BE011689 BE011627 BB011679 BE011699 BE011678 BBH 1 696 BE01 1675 BE01 1622 BB011635 


55 


455651 


1348732.1 


BE064962 BE084979 BE064853 BB084857 BE064856 BB064977 BE064 960 BE064860 BE064815 BB064957 BE064804 


BS064816 BB064850 BE064806 BE064796 BBQS4818BE064975BE064819 BE064810 BEC64668 BE065059 




455685 


1350393.1 


BB066976 BE066928 BE066927 




455700 


1351264.1 


BBJ68115 BE068104 BE0681Q2 8E068096 BE0681Q3 BED68154 BE068198 




455708 


1352232.1 


BE069326 BE069290 BE0S9352 


60 


455732 


1353874L1 


BBD80908 BE072258 BB072190 BE072236 




455838 


1374605L1 


BE145808 BE145807 BE181883 




455935 


1384144.1 


BE158687BE158688 




455945 


1385588.1 


BE16C636 BE160606 BE160703 




456207 


165078.-1 


AA193450 


65 


456482 


1922B9.1 


AA485224 AA287308 AA25B121 




458094- 


47311.1 


ARJ86325 W72956 W73221 AA219112 




458673 


679507.1 


N99626AJ30Z701 



f 
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TABLE 19B 



5 Table 19B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 19. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 
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15 



Pkey: Unique number corresponding to an Bos probeset 

Rat Sequence source. Tl»7d^nun*OThfl]fec»lunmaraG8ntaiikld^^ 

enGM^(^se9«icec/hunaid«nnKisame22.* Dunham Let at. Nature (1999) 40M89-495. 
Strand: Indicates strand^ which «m 

NLpcsffion: tnlicatQ3nucleo^pos3iorQofpicclc^excm. 
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Pkey 


Ref 


Strand 


NLposfflon 


400555 


9801191 


Minus 


134694-134817 


400803 


9887666 


Moms 


9675647558 


400610 


9887671 


fcffinus 


117606-117928,124040-124147 


400925 


7651S21 


Plus 


3818S38391.4390O44086 


401045 


8117619 


Plus 


9004440184.91111-91345 


401049 


7232177 


Plus 


149157-150692 


401093 


8516137 


Minus 


2233533166 


401256 


9796573 


Minus 


4548245620 


401283 


9800093 


Minus 


4725647456 


401326 


9212516 


mm 


226246-227505 


401418 


7452889 


Minus 


124885-125075 


401451 


6634068 


Minus 


119926-121272 


401458 


9187886 


Plus 


76485-77597 


401497 


7381770 


Plus 


92607-92813 


401508 


7534110 


Minus ■ 


110779-110983 


401575 


7229804 


Minus 


76253-76364 


401747 


9789672 


Minus 


118S9M18816.119119419244,119609-119761.120^^ 








131258,131866431932,132451432575^133580434W 


401781 


7249190 


Minus 


83215*343533531-8365633740^8390134237^ 


401785 


7249190 


MkMS 


165776465996,166189-166314,166408-166569.1671^ 
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TABLE 20: 544 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT TISSUES THAT ARE 
LIKELY TO ENCODE EXTRACELLULAR OR CELL- 
SURFACE PROTEINS 
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Table 20 shows 544 genes up-regulated in breast cancer compared to normal adult tissues 
that are likely to encode extracellular or cell-surface proteins. These were selected as for 
Table 19, except that the ratio was greater than or equal to 3.0, the "average" normal adult 
tissue level was set to the 85 th percentile value amongst 144 non-malignant tissues, and the 
96 th percentile value amongst the 73 breast cancer specimens was greater than or equal 100 
units, and the predicted protein contained a structural domain that is indicative of 
extracellular localization (e.g. ig, fo3, egf, 7tm domains, signal sequences, transmembrane 
domains). The predicted protein domains are noted. 
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,T^cadherto l CaoWiiiJC_tenn r 
,SS/rra, 

JHSDF,UPARLYB, 

TM 

SS 

,SS,UDPGT 
SS,SS 
SS 
SS 

SSJMJM 

^SSPepJ^jHopepJleprolys^ 
,SSJM 



ESTs 

transforming growth factor, beta 2 
ESTs 

hypothetical proteto FU13782 
Hon»saplenscDNAaJ14035fis, done HE 
TanjetExon 

gtary57g0U1 NCLCGAPJ>r18 Homo sapiens 
ESTs 

CGV62 protein 

solute carrier fiamfly 6 ^leurotraitsmlUe 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

Homo sapiens mRNA; cDNA DKFZ0761C1712 (f 

ESTs 

ESTs 

hypomeficaJ proteh FU20706 
transcription factor AP-2 beta (ac&vml 
hypothetical protefri FU23045 
ubtnudehl 

UV-1 protein, estrogen regulated 
ESTs 

duodenal cytochrome b 



,SSJM,dis}rdEgnn^JA12B4^ ESTs 

SSJiAjnybJ>NA*ha%ig >Hnyb avian myetoWastDsb viral cmcogen 

SS ESTs 

SS£GF,tsp__3£S£2FJDP, cartilage oDgomertc mafetx protein (COM 

SS f aWeo7i > SS ) aJdedh t akfehyte Dehydrogenase 3 fenfy 
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95 
95 
9.4 
9.4 
9.4 
9.4 
93 
93 
9.1 
9.1 
9.1 
9.1 
9.1 
9.0 
83 
83 
83 
83 
83 
8.7 
8.7 
85 
8.6 
85 
85 
85 
85 
8.4 
8.4 
8.4 
83 
83 
83 
83 
83 
8^ 
B2 
82 
&2 
8.1 
8.1 

ao 

83 
85 
83 
73 
73 
73 
73 
73 
7.7 
75 
75 
75 
75 
75 
75 
75 
75 
75 
7.4 
7.4 
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73 
72 
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426451 
450701 
413519 
410555 
433m 
411558 
409079 
417275 
43Z731 
442816 
407386 
427427 
410785 
401045 

418986 
442082 
442861 
418836 
422060 
444381 
404091 
417866 



AI908165 Hs.169946 
K39960 H&288467 
AI198719 Hs.176376 
U92649 Hs.64311 
AB029496 Hs59729 
AA102670 Hs.70725 
W87707 Hs32065 
X63578 
R31178 

AK001741 Hs*739 
AFQ26942 



SS.GATA, 
,SS>IKR 
SS 

SSJt) t Sen«,SS^ema^hand 
SSJM.SSJM 
,TMM 

Hs35449 SS,efiiaixl^,etharid,ras 
H&267820 3S,fci3^1 t fn2,(n2/n1 
WD40SS 
JMJBR 



AF077345 Hs.177336 SSJecfrucSS 



AW803341 



SS 



C11001883*^q6753276|ref^PJD33938.1| c 
A1123555 HsJ1796 .SSJteproly^tspJ. 



lCEji20^SJCEj)10,lCEj)20,ICEj)10 t iCE_p 

6.7 



GATAbtodlng protein 3 (T-ceB receptor 
Homo sapiens cONAFU12280fts,ctone MA 
ESTs 

a disfntegrin and meiaDoprotelnase doma 
semaphorfn sem2 

gamnB-aminobutyncack)(GABA)AreceptD 
tnterteuta 6 signal transducer (&P130, 
paralbumin 
fSuonecfin 1 

hypometfcal proteh FU10879 

gfcHomo sapiens c(g33 mRNA, partial sequ 

ESTs 

gtxfi^-UM0079<)9030(MfiaO03 UM0079 Homo 



R41823 Hs.7413 
AA243837 Hs57787 



410Z75 
425236 
415669 
416319 
412140 
442942 
446163 
443162 
409602 
428479 
400300 
433404 
410079 
401781 
447359 
402230 
4Z7674 



ESTs 

ESTs; csJsyntenin-2 
ESTs 
ESTs 

ESTs, Modeiatetysinaarto AUJ5J1UMAN A 
ESTs, WeakfystoiBar to S64054 hypofftefl 
Target Exon 
cotagen, type XI, alpha 1 
K1AAQ575 gene product 
tjaftscnpflon factor AP-2 gamma (acfivat 
stanntocaldn 2 

serine (or cysteine) proteinase hhtbtto 
C083an^en(acfivatedB lymphocytes,! 
RAB6 Weracfing, UnesMke (rabWnes 
ESTs 

protec&t receptor 
DKFZP434G032 protein 
ESTs 

cell division cycle % 61 to S and G2 to 

,SSJHpknase£ecepJ^^ 



> TM^PH.tbo\pkinase,SAM,fr)3, 
SS 

AJ655499 Hs.161712 ,TM>UMuecp,pktnase^eafi^U5, 
R20893 H&325823 3S.TM.CD38 
BE387335 H&283T13 ^m9o_carr 
NA ^THTmU^NFjBceptor, 
AW067903 HsJ2772 SS^IIageaCOtfl.TSPN.SS.TSPN 
AL135623 Hs.193914 
U85658 Hs.61796 
AW067800 Hs.155223 
NM_0Q5Q25HsJ8589 
AI815601 Hs.79197 
AA219691 KS.736S 
A W1 67087 Hs.1315B2 
AA026880 HsJ25252 
T49951 K&9Q29 

W26713 Hs^S6972 .SSJHDAGKaflAGKc, 
Y0Q272 Hs.184572 .SS.pJtaasejMnase 
X03363 

T32982 Hs.102720 
U94362 Hs58589 



»SS^bosomaLS4e 
SS 

.SS^erph, 

SSJBUg^SJM 

.SSJonesfc 



*SSJHm3. 
fflamentSS,fianient,Qantefit 



SS 

GlycoJransL8^S 
.SS^fementpnTxsytranJflam 



419968 
416636 
419703 
424687 
449679 
4212S6 
442117 
400303 
419440 
444858 
432239 
440705 
400286 
446466 
423201 
433043 
60 439509 
425247 
409757 
425292 
448045 
452631 
452243 
439310 



NMJM2093HS.18268 
HA 

NM_003528t&2178 
AI249368 Hs.98558 
BE55Q224 Hs.74170 
XD4430 H&83913 
N32536 Hs.42645 
AI7S3257 Hs.128151 
JQ507O Hs. 151738 
AI823951 Hs.129700 
NMJXK666HS.1Q3253 
AW664964 Hs.128899 
AA242758 Hs.79136 
ABQ20689 Hs30419 
AH99738 Hsl208275 
X81334 Hs^936 
AA904244 Hs.153205 
HA 

H38026 Hs308 
NM_0Q0163Hs.125180 SSJMMSS 
W57554 Hs.125019 SS 
AF086332 HS38314 .SSJHSyntaxh 
HKL005940HS.155334 SS^Pepik^J^IO^emopexfr]^ 
NMJD01898HS.123114 ,$$£ystafin, 
NKL005824HS.155545 SS 
AJ297436 Ha2M66 £$JM 
AF153330 HS3Q246 ,SSJM 
AL355715 Hs38555 SS 
ARJ86120 Hs.102793 .saTHUDPGT^aseinJcappa 



ESTs 
gtycogemn2 
Target Exon 
adenylate kinase 5 
Target Exon 

H2B hlstone lamBy, memberQ 
ESTs 

i \t ■ U ixfKtn-ii-r Jr 4 C K«uJSmuJ\ 

rneta»tnjonein ib(uffKoonai; 
IntefteuMn 6 (Interferon, beta 2} 
sokrie carrier lamBy 16 (ronceaifeoxyfc 
ESTs 



,SSJMip45Q, 
rassone^xMUSione, 
,SSJM 
SS 

SSJL6JI 6, 
>SSJM 
,SS^C2H2, 

SSfep&bseJAWWtmopexm fiietaDoproteiiase 9 (gefertase B 

SS tolokHrtel 
SS periGpm 
,SSJM ESTs 
JSSJto UV-1 protein, estrogen regulated 

SS WAA0B82 protein 

SS ESTs, WeaWy siniterto ALUAJWMAN nu 

SS,Peptkias8jrt10,hemopexto^^ metaDoprotehase 13 (coDagenase 

TM 



SS»THABC_tran^^membfane,SS 



ESTs 

C16000922^499103|ptaJT209Q3 hypothe 

aiiestbi 3, retina) (X-arresfln) 

growth hofmone receptor 

lymphoid nuclear protein (LAF-4) mRNA 

ESTs 

inatrfacmetaAcivotatoase 11 {KWP11;sm> 



37 kDaleuine-ntt repeat (LRR) protein 



sctocarnOTtan^19(tWamlr»tjam 

programmed ceil deaft 9 

ESTs 



7.1 
7.1 
7.1 
7.1 
7J0 
7.0 
7.0 
7.0 
6.9 
63 
GZ 
6.8 
&7 



a7 

6.7 
6.6 
6.6 
6.6 
&6 
6.6 
6^ 
6^ 
6.4 
6.2 
6^ 
6^ 
6^ 
6.2 
6.1 
6.1 
6.1 
6.1 
t1 
6.1 
6.1 
6.1 
6.1 
6.1 
6.1 
6i) 
6X) 
6i) 
64 
5J 
5.8 
53 
SA 
5.7 
5.7 
5.7 
5l7 
5.6 
5.6 
5.6 
5^ 
5^ 
5\5 
5.4 
5.4 
5.4 
S4 
5.4 
&3 
53 
52 



307 



WO 02/059377 



PCT/US02/02242 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



441111 
452355 
427711 
418636 
429353 
441690 
430447 



425325 

423600 
414737 
403593 
407758 
445234 
411165 
420633 
414117 
416783 
401093 
411096 
457411 
430)07 
450506 
417975 
421072 
427032 
447752 
403199 
427122 
445900 
413048 
419563 
442432 
452093 
442323 
450606 
435542 
417576 



AI806867 Hs.126594 
N54926 H&292Q2 
M31659 Ks.180406 
AW749855 

AL117406 H&200102 
R81733 Hs33106 
W17064 Hs332848 
AI685086 H326339 
X52730 Hs.1892 
phenylethanolarrtihe N-metftyftja isfefdyi 
AK33559 Hs310359 SS 
AJ160386 Hs.125037 
NA 

D50915 H338365 
AW137636 Hs.146058 
NMJ0OO169Hs£9O89 
NM.014581H&274480 
W88559 Hs.1787 
AA2061B6 Hs.79889 

U80Q34 Hs.68563 

AW085961 Hs.130093 

A1247716 Hs^32168 
NM.004460H&418 

AA641836 H&30G85 

A1215069 HsJ9113 

AF012023 Hs.173274 

M73700 Hs.105938 S3Jransfenin t 7trrLl.transferrtn, 



ESTs 

G pratein-coupfed receptor 34 
salute carrier fiamBjr 25 (mftMhondrial 
gb*2V4ST0534-281299453<tf5 BT0534 Homo 
ATP-bWtog cassette transporter MRP8 
ESTs 

SVWSNF related, matrix associated, acb* 
ESTs, WeaWy similar to S21348 probable 



.SS.TM.PhosphodiBSt 
TfttffmJJM 
SS 

.SSJMjiECT 

.SSJM^BC.toan^BCjnernbrane, 
,SSJffiCT,zHJBR1,PABP ( 1*33, 
SS 

,SS/BS, 

5.1 
ESTs 

SS ESTs 

.QDE-N.pldnase Target Exon 

SS.SS WAA0125 gene product 

,SSJM ESTs 
SS>te&iase^nKPH f pkinase,SH2,SH3;foo fjatn rf r r ataw , alpha 

SSAttcaBrvSSJtmafr) oAtfiW«flng protein 2A 

.TMMJransJUetra, proteclpM potato 1 (PeBzaeus-Menbac 

SSJMJM rronoc^ to macrophage dffferen^ 

THLRRCTJHLRRCT, C12000586 , ^6330167|oTpAA86477.1| (A 

Pepfidase_M3, mdochondrial intermediate peptidase 

SS ESTs 

.SS^dTLzinc, ESTs 
SSJ)PPIV_K,terrrKPepaias8_S9,SS^fWJi Jbrobbstacfivatfen protein, alpha 

.SSJrypsin hypotoetfcal proteti RJ23186 

SS ESTs 

,$$,14-33 bfegm cytoplasmic cfcrnain-asscciated p 



NA 

AW057736 Ha323910 
AF070526 Hs.13429 
M93221 Hs.75182 
AA526235 Hs.193162 
BE093589 Hs38178 
AA447453 H&27860 
AW016669 K&29190 
AI6686Q5 Hs.60380 



SS.TI^ctete.cajr^.SS.TM^cte.cama NM_Q25243*".Homo sapiens solute carrier f 
r SSJ1klpttroseiRecepjLdo^ receptor tyrosine kinase {c-erb-b2, 



1 SS f Ca_channeLP» 

SS,TMJec^c^^drL3.fecfln,SS,TM 
SS 
SS 

,SSJK7tmJ. 
,SS,7HC»S,vofiage_aC 

,TMGlycojTydro_1 

AAHJ7376 H&269533 l SS I pkhaseJ*hc<3ff .&.PKSH3, 
AA339449 Hs£2285 



Homo sapiens done 24787 mRNA sequence 
mannose receptor, C type 1 
Homo sapiens cDNA FU11983fis, done HE 
hypothetical protein FU23468 
Homo sapiens mRNA; cDNA DKFZp586f$J723 (f 
ESTs 

ESTs, Moderately similar to ALU6JHJMANA 
ESTs 



445413 
424420 
432378 
452130 
434674 



421582 
410361 
426327 



452834 
427315 
446733 
442118 
421524 
453060 
453403 
444301 
453619 
432656 
426384 
431701 
416931 
420854 
418867 
443514 
447499 



AJ860Q21 H&270651 
AA151342 Hs.12677 

BB614743 Hs.146688 ,SSJMMAPEG. 
AM93046 Hs.146133 ,SSJHUDPGT 
H26735 HSJ91668 
AA831879 Hs.136985 
AB45455 Hs.78915 
AI910275 Hs.1403 
BE391804 H&626G1 
W03242 Hs.44698 
M97711 



SSJWWJM.GBP 
SS 

SS£S& 

AI638627 Hs.105685 ,SSJ)EADJon\_head 
AA179949 Hs.175563 SS 
AA863360 Hs26040 ^S.THp450, 
AA976718 H&2Q2242 ^Sema. 
AA312082 Ha.105445 
AW294Q92 H&21594 
BE466639 Hs.61779 
AKD00136 Hs.10760 
K87648 K&33922 
NM_000246Hsl3D76 
A1472078 H&303862 ,SS,ArfGap, 
AW935490 Hs.14658 .SS.BIR 
D45371 Hs.80485 
AW296927 
D31771 Hs*9404 
BE464288 Hs.141937 
AW262580 Hs.147674 



AIRS,fomTyLtranstGARS,SS t GARS^IRS^or ptosphcri)osyiglyariaiTKfe forrnyftransfer 
4*tese ESTs,Moc^rateJysimfiartoA47582B^»l 
SS,UPFC099 r SS ? UPR)099 l CGH47 protein 

prosfag&ndh E synthase 
ESTs 

,SSJMW,SH2^Hto^^ HorrosapiBrBctorePP1498imJTOOT 
.SStfsUleaceM ESTs 
pHnase,OPR. GA^hdhg potato M eu tp tf on factor, 

SS,trefb^7HtdbecepL*^ trefoil factor 1(pS2) 



SS 

,SS,ras,Y_jphosphatasej3s 
tSStHMGJboXrQament 
SSJ-RR.SS 
SS 

SSXRR, 



,SS,THPepftbseJlH t 
SS^honteotoUnrneobox 
•SSJKMIP, 

,SS,TM£ariherin^cadherin 



guanytatebtoc^protetol. Interferon 
Homo sapiens done TCCCTA00151 mRNAsequ 
gfcHuman T-cel receptor (V beta 18.1, J 
K1AA1688 protein 

HornosapleremRNA;d)rMDKFZp564N0763{f 
ESTs, WeaWy simitar to fatty arid omega 
ESTs 

GDNF femBy receptor alpha 1 
hypofceScal protein MGC15754 
Homo sapiens cDNA FU13591 fe, done PL 
asporh(LRR ctess 1) 

Homo sapiens, ctone MGC3084, ntf*NA,comp 

MHCdassDbarisacfivator 

ESTs 

Human d im inosorne 5q13.1 done 5G8 mRNA 



gbdAHBnW^(H>07^U)Ls1 NCLCGAP.Su 
msh (DrosophBa) homeo box homotog 2 
ESTs 

pjotocadhertn beta 16 



5^ 
52 
52 
52 
5.1 
5.1 
5.1 
5.1 



5.1 

5.1 

5.1 

5.0 

SS) 

A3 

43 

A3 

A3 

A3 

A3 

A3 

A3 

A3 

A3 

A3 

A3 

A3 

A3 

A3 

4.7 

4.7 

4.7 

4.6 

4.6 

4.6 

4.6 

4.6 

4.6 

4.6 

4.6 

4.5 

A3 

A3 

A3 

A3 

A3 

A3 

A3 ' 

A3 

A3 

4.4 

4.4 

4.4 

4.4 

4.4 

4.4 

4.4 

A3 

A3 

A3 

A3 

A3 

A3 

A3 

A3 

4.3 
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441560 F13386 Hs.7888 


■ t,l.. nnr. 

JXQTIaSe, 


Homo sapiens clone 23736 mRNA sequence 


43 


409064 AA062954 Hs.141883 


,SS.CUB, 


ESTs 


43 


422667 H25642 Hs.133471 


t SS,TM,FM0-tIk8 


ESTs 


43 


454032 W31790 Hs.194293 


JSSJM 


ESTs. WeaMystrnflar to 154374 gene NE2 


43 


432663 AI9B4317 Hs.122589 


TM 


ESTs 


43 


401747 


,SS,ffiamenitiarn6nt 


Homo sapiens keratin 17 (KRT17) 


43 


432882 NM_013257Hs279696 


pkEnssBtpkfnasBjC, 


senJnVgtucocofttoti 


4.2 


437036 AK71514 Hs.133022 


,SS,TM 


ESTs 


*2 


447754 AW073310 Hs.163533 


.pttnase, 


Homo sapiens cDNA FU14142 fis, done MA 


42 


443134 At954968 H&2790Q9 


,SS,TM 


matrix Gta protein 


M 


451871 AI821005 Hs.118599 


3S.GDNF, 


ESTs 


42 


457211 AW972565 Hs32399 


WH1.WH1 


ESTs, Weakly stmflar to S51797 vasooHat 


A2 


421566 NM.000399Hs.1395 


zf£2H2,SS 


eartVgn^rBspor^2(Kna-20(Drosop 


A2 


431657 AB45227 Hs.105448 


,SS,THpkinase 


ESTs, WeaMy similar to B34087 hypothefl 


4.1 


427899 AA829286 Hs332053 


l SS,SAA4XDtefrMBCjnembrane^BC„tran l S8rum amyloid A1 


4.1 


444779 A1192105 Hs.147170 


SS 


ESTs 


4.1 


442295 AI827248 H&224398 


,COLR,vwc,Co9agen, 


Homo sapiens cDNA FU1 1469 fe, done HE 


4.1 


436396 ABB3487 Hs.152213 


iWitt 


wingless-type MMTV integration site tami 


4.1 


446039 AI150491 Hs30756 


,TM,GlycoJtyrfroJ 


ESTs 


4.1 


422938 NM.001809Hs.1594 


^TM,Wobse, 


centromere protein A <17K>) 


4.1 


406922 S7Q284 


SSJM£esa1urase,SS 


g^xstearoyt-CoA desaturase [human, adfpo 


4.1 


439285 AL133916 Hs.172572 


.SS^.pktnase^RMT^RCT, 


hypdhefcalprotehFU2DQ» 


4.1 


424800 AL035588 Hs.153203 


HLH.SS 


MyoDfamfiy&ih&)9or 


4.1 


429922 Z97630 H&226117 


^,7MJmkerJtone,7trrL.1 


HI htetone femDy f member 0 


4.1 


447178 AW594641 Hs.192417 


JSSJM 


ESTs 


4.0 


409038 T97490 Ha500Q2 


SSMSS.IL8 


smaflir>duc*te(ytokiresut^^ 


4.0 


452747 BE153855 Hs.61460 


JSSjtHM 


Ig superiamBy receptor LNIR 


AS) 


420139 NM_005357Hsl95351 


l SS > 7Hp450, 


lipase, honnone^ensiSve 


4.0 


408877 AA479033 Hs.130315 


,SS,TM 


ESTs. WeaUy stmflar to A47582 B-cefl gr 


4.0 


403329 NA 


SS,SS 


Target Exon 


4.0 


439926 AW014875 Hs.137007 


SS 


ESTs 


4.0 


4308% A1073913 Hs.100688 


SS 


ESTs, WeaMy slm2ar to JEQ350 Anterior 


4.0 


432481 AW451645 Hs.151504 


,SS,CoDagen,COLFl,TSPN, 


Homo sapfens cDNA FU11973 fis, done HE 


AS) 


452410 AL133619 H&29383 


.SS.THras 


Horrosa^eroinRNA; cDNA DKFZp434E2321 (f 


4.0 


418661 NM.001949Hs.1189 


SS 


E2F transcription factor 3 


AH 


431958 X63629 H&2877 


SS,THCadhefltjCJernvafi^SS,7M^ 3, type 1, P-cadhem (placenta 


AS) 


425071 NMJ)13989Hs.154424 


SS > T4_dekxfinase ( T4_dekxftose, 


debdinase, kxktthyronbe, type n 


4.0 


447197 R36075 


JHSDF, 


grxyhB8b0i3l Soares placenta Nb2HP Homo 


4.0 


428722 U76456 Hs.190787 


,SSJtMP> 


tissue inhibitor of rnetaQoprototnas84 


3.9 


428330 L22524 H&2256 


SS^eptkiaseJtflO.SSJ^ficJase _M10,h8mopematiix metaQoproteinase 7 (MMP7; uterin 


33 


423242 AL0394Q2 Hs.125783 


SS 


DEME-6 protein 


33 


449048 Z4KJ51 Hsl22920 


SS.SSJM 


sbnSartoS66401 (cattte)ghuosemth)c 


33 


414831 M31158 Hs.77439 




pn^kirase;cAXP-o^pe«lerjtreguiato 


33 


413589 AW452631 Hs3138Q3 


.SS^bnydtolase 


ESTs, Highly simBarto AF157833 1 nond 


33 


408875 NMJH5434HS.46604 


SS 


DKFZP434B168 protein 


33 


418629 BE247550 Hs36859 


SS,SH2J ) H^S,7HPH > SH2 f FuiWIkB,pkto growftt^receptoi^ound pnrfem7 


33 


450787 AB006190 H&25475 


SS,mMff\SSJMWIP, 


aquaporh7 


33 


414870 N72264 Hs300670 


SS 


WAA1204proteto 


33 


450325 AI935962 Hs^6289 


SS 


ESTs 


33 


407633 NMJ>07069Hs371B9 


TMJM 


similar to rat HREV1 07 


33 


426172 AA371307 Hs.125056 


,SS£ENN 


ESTs 


33 


442262 BE170S51 Hs3700 


JSSJSTART t 


deleted to Over cancer 1 


33 


427961 AW233185 Hs.143134 


SS 


ESTs 


33 


445563 AW873606 Hs.149006 


.SS.WH1.WH1 


ESTs 


33 


403943 


p45O > SSj>450 


C5O)0^gIi450^225lreiINP^(X)0765.1 1 cyt 


33 


408761 AA057264 H&238936 


,SSJM.7tnU 


ESTs, WeaWy stmaar to (defOne not ava 


33 


423279 AW959861 H&290943 


SS 


ESTs 


-33 


420440 NM_002407Hs37644 


.SS.SRC^UterogJobin 


mammaglobh2 (MGB2; mammagbbtn B; Dp 


33 


445107 AI208121 Hs.147313 


,SSJM 


ESTs, WeaWy sinter to 138022 hypothefi 


17 


428303 AW974476 Hs.183601 


SS.RGS.R6S.RGS 


regulator of Gfrotatosfenatog 16 


3.7 


411667 BE160198 


TM 


gbXJV1-mW13^020M5Wi03 HT0413 Homo 


37 


427809 M26380 Ks.180878 


,SSJpasePUT,Sec7,PH, 


flpoproteto Bpftw 


3.7 


418203 X54942 Hs3375B 


CKS.SS,CKS, 


CDraproteto kinase 2 


37 


430376 AW292053 Hs.12532 


SS 


dvornosorne 1 open reading frame 21 


3.7 


444190 AI87B918 Hs.10526 


SS 


(7Ste*ieandg]ycWfichproteBi2 


3.7 


433495 AW373784 H&71 


SS.Ig^lHCJ^nnexin.SCAN.SS.TM 


aJpha^-gAcoproteln 1, zfoc 


3.7 


429638 AI916662 Hs211577 


SSJMlSS 


(dnecflnlfkhesin receptor) 


3.7 
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454071 
451859 
420281 
427691 
428824 
424676 
418026 
457465 
417601 
407999 
425548 
446619 
411213 
408625 
417511 



407137 
401866 
451195 
426044 
426310 
440029 
408573 
431830 
444781 
431493 
414175 
411789 
418851 



407104 
449051 
434398 
454042 
459496 
414993 
407756 
442101 
449722 
452554 
421991 
420058 
425776 
407846 



445873 
418054 
414921 
452268 
428862 
412520 
410530 
401780 
447131 
418334 
415138 
421168 
431473 
421379 
411984 
408101 
405366 
414612 
411393 
435767 
416406 
433068 



A1041793 Hs.42502 
H44491 H&2S2938 
A1623693 Hs.191533 
AW19442B H&20726 
W23624 H&173059 
Y08565 Hs.151678 
BE379727 H&83213 
AW301344 Hs.122908 
WL014735HS-82292 
AJ126271 . Ha.49433 
AA890Q23 Hs.1906 
A U076643 H&313 
AA67B939 HsiJ9285 
Y13647 Hs.119597 
AL049176 rteB2223 
AW207175 Hs.106771 
T97307 

U10492 Hs.438 
AA5Q2490 H&336695 
NM.000909H&169266 
AW089705 Hs293711 
AA284775 Hs.43148 
Y16645 H&271387 
NM0144OOHS.1195O 
AT791493 Hs.129873 
AQ08876 Hs.103849 
AF245506 Hs72157 
AM17828 Hs.182435 
AA847843 Hs£2711 
S57296 Hs323910 
AW961400 H&333526 
AA121098 Hs3838 
H22570 Ks.172572 
AA808940 Hs274450 
NM.002543H577729 
AA116Q21 HS38260 
AI651930 Hs.135684 
BE28O074 Hsl23960 
AW452434 H&58006 
NMJH4918HS.110488 
AWKH423 Hs34694 
U25128 Hs.159499 
AA426202 H&40403 
134041 Hafl739 
AA250970 H&251946 
NM_002318Hs33354 
BE390551 Hs.77628 
NML0Q3512H&28777 
NM.000346H&2316 
AA442324 Hs.795 
M25809 Ha.64173 

NM004585HS.17466 
AA319233 HsJ5521 
C18356 H&S5944 
AF182277 Hs330780 
AA825686 Hs321176 
Y15221 Hs.103982 
NMJD05419H&72988 
AW968504 Hs.123073 

BE274552 HsJ6578 
AW797437 HSJ69771 
H73505 Hs.117874 
D86961 Hs.79299 
NMJ»6456H3288215 



JM,7tmJ, 

.SaTM^JdUecepLsOdUecepUE^ 
>SS,AAA, 

,SS,GrycosJransL2. 
SS 

GlycoOanst2^icin3JediaSS > G^ 
Bpocafin,SS > fipoca{kiJlpoc^iivfefrffl 



PTO.pHnase,SS 

SS 

SSJWn3,SSJKifn3, 
.SSJtyeflwdMJrars 

S$,mCUBJ3J*Jypej;,MA^ 1 
SSJWtesaturase.SS 
SS 

,SS,7tmJ.SPRY l 
^S,1HGDA1_C039 
.SS.fOament, 

SSJttffleoboxJ Et s,SSthomeobox, 
SS 

SS,TM,7tmJ, 
SS 

3S.7MPMP2^Ctaudin, 

,SSj*JadamaseJ3 
,SS,p450j>450 



ESTs 

ESTs, Weakly similar to ALU1 JHUMAN ALU S 

ESTs 

ESTs 

ESTs 

UDP4^acergn^a!pha^>flalac^^ 
tatty add binding protein 4, adipocyte 
DNArepDcafonfector 
WAA0215 gene product 
ESTs, Weakry similar to YZ28JJUMAN HYPOT* 
prctecfin receptor 

secreted phosphoprotetn 1 (ostecpontin. 



chonfirHDoe 
ESTs 

gfeye53h05.s1 Soaies fetal Over spleen 
TargetExon 

mesenchyme homeo boot 1 
ESTs 

neuropeptide Y receptor Y1 
ESTs, Weakly stater to S64329 probable 
ESTs 

smal forfuc&te cybMhe suWajnfly A (Cy 
GPI-anchored metastasis-associated prote 
ESTs, Weakly sftnBarto A36036 cytochrom 
JMJrawipe»nJ > ept^^ 

tgJJ^RCT, DKFZP564I1922 protein 

JSS.1M ESTs 

,SS.HMG_box, Homo sapiens, done IMAGE3351295, mRNA 

,SS,Tr4SH2Wpkmase£e<^J^&)^ avian efymrobfasfic leukemia v 



SS 



pkinase.POLO Jxa^, pJdnase,POLOJxK, 

l SSJMpKRAB f SCAN^C2H2Jg 
SS.TM 

SS,UCH-1,UCHZSS,mGjhUranspept 
SS 

cycfriSSJMiyc&n, 
SS 
SS 
SS 

SSJM,7tou2.SSJMJtmJ* 
/Tfc/^ABCLjneiTibfB)0^BQJianJ?3)osoniaL! 
,SS JM,tanspwU)ro^ afcrrydrogenase 1 

^s}rmPABP > pWnase,14^3 t rnn poly(A>toDng protein, cytoplasmic 1-1 

^^TNUiritDjcan^LysyLcudase rysytaxfdase-flte 2 

SS^n*ART^S,STARfT>iNMTJ , NMT_TEMT, steroidogenic acute regulatory protein r 
SS^fetore^^CGRP^lAPP^^WC^SPRY^A hfetone family, member L 



HER2 receptor tyrosine Idnase (c-erb42, 
senim-tnducible kinase 
rrypofteik^prctetoRJ20093 
EST 

ocddbed low density lipoprotein (tecfin 
ubfynfin specific protease 18 
ESTs 

cytfinBI > 

ESTs. Weakly slmtoto ALU5JUIMAN ALU S 
WAA0990 protein 

HomosapiertscONAFU10S61 fis^dooe WT 
paiaflryicJdtomtone receptor 2 

transadlvator, wft 



SS^HMGJxo, 
hstort8,SS r histDne # BoIA 
A7P-«ynLab,SS,7fjn_t^TP^ynLab 
ftamentfSS^amerrt^nlarnent 
TM 

^TW&o$ornaLLZ7e ( 

.Kunffc^BPTT. 

SS4>450SS 

SS 

SS4L8, 

SH2£TAT,SS,STAT 

,,,iit 1 

WwGEF.PH.SS^hoGEFPH, 

SAP^SWSAP.vwa 

SS^ustytnypsfewamr^ 

^S^epWase_S8P,Pepfioase_S8,P 

.SS.TM 

,SS r Prit»syttran, 
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SRY (sex detennWng region Y)-bcx 9 (ca 
H2A hfetone family, member O 
ATPase. H trareporfing, rysosomai (vacuo 
NM_005K7**tomo sapiens keratin 16 (toca 
retinofc add receptor responder (tazaro 
ESTs 

Sssue factor paftway Inhibitor 2 
cytochrome P450, subfamily IB (phenobar 
ESTs, Weakly simter to S65824 reverse t 
smal inducible cytokine subfamBy B (Cy 
skp^tiansotoBr and activator of trans 
CDC2-reteied protein kinase 7 
NMJXB371*tHomo sapiens vav 2 oncogene ( 
proteki tohtoitor of activated STAT3 
B4actor, properdin 
ESTs 

Bpoma HM&C fusion partner-like 2 
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445462 AA378776 Hs288649 SS.SS hypoftefcal protein MGC3077 33 

439452 AA918317 rte^7987 SS.SS B^Cllflymphoma11B(zfric finger pro 33 

452017 AF1083Q2 Hs27495 SS prostate cancer associated protein 7 33 

403099 AKD00725 Hs£Q579 SS typothefcai protein RJ20718 3.3 

5 452106 AI14KB1 Hs21342 SS ESTs 33 

447519 U4S25B Hs339665 SS ESTs 33 

426928 AF037082 Hs.172914 ,SS^M»^T^^T0 : bjHopepBd8 refinolOBhydiDgenas85(11^and9Ki 33 

438825 BE327427 Hs.79953 .SSJMJfetorK^jeceptQf^^ ESTs 33 

414575 H11257 Hs22968 ^SSjjJdnaseJg, Homo sapiens done IMAGE451939, mRNA se 33 

10 417837 AL0799O5 Ks.1103 SS,TGFb_propepfltte I TGF-beia t SS transfom^ gro*tt factor, beia 1 33 

42213 AW881145 SS gbK3VT>On)033^0400.182-a07 OT0D33 Homo 3.3 

445941 AI257371 Hs.172636 SS ( SS r fipoxygenase^UT ESTs 33 

42S973 AM23317 Hs.164680 .SS.T-to.UDPGT ESTs 33 

444542 AI161293 Hs280380 SS,SSPep&tese^M1^GFJgJec&x.c^BW amhopepftasd 33 

15 459561 AI547306 Hs.134981 SS ESTs 33 

425741 AF052152 Hs.159412 .pkinase, Homo sapiens done 24628 mRNA sequence 33 

426501 AW043782 Hs293616 SS ESTs 33 

456508 AA5QZ764 Hs.123469 SS ESTs, Weakly similar to AF208855 1 BM-01 33 

434228 Z42047 Hs283978 jSSJMTtaU Homo sapiens PROZ751 mRNA, compfetecds 33 

20 415752 BE314524 Hs.78776 TM putative barisn«mbrarepro^ 33 

400419 APD84545 .SS^PepBdaseJAI, Taipei 33 

439750 AI359053 H&57664 . ,mirtea7iiv3 r Rk^B Jedh^ Harx) sapiens mRNA fciSler^ Insert cDN 33 

423858 AL137326 Hs.133483 ,SSJM Hoiro sapiens rrtRNA;cDNADKFZp434B0K0(f 33 

428514 AW236861 Hs.193139 ,SS,START P NNMLPNMLTEMT I ESTs 33 

25 428698 AAB52773 Hs334838 SS WAA1866 protein 33 

448988 Y09763 H&22785 SS/TMJM garnma-ajranoburynfca^ 33 

432072 N62937 Hs269109 ,SemaJg, ESTs 33 

417433 BE27Q266 Hs3212B SS,TMJLRRCTJJ?RNT4J^THI-RRCT» 5T4 oncofetal trophc^tastglycoproteh 33 

452194 AI694413 Hs332649 JSSJWtaLSJWjecefajs^ oflactofyrecepiDf.far^2,subfam^ 32 

30 444051 N48373 Hs.10247 ,SSJ|g, acfirated leucocyte ceB adhesion mofecu 32 

420042 AW01514O Hs.161723 .SS.CUB, . ESTs 32 

457292 AQ21270 Hs334882 SS,TM r SS,TM,Gi5afch hypoftiefica) protein FU14251 32 

421458 NKL003654H&104576 SS carbohydrate (teratan sulfate GaW) sui 32 

431104 AW970859 Hs313503 .SemEug, ESTs 32 

35 443767 BE562136 H&9736 t SS,PCUlasGEF > h(mr»najec l 2^C4 r protBasont8(prosomej,inaciopaln)26S 32 

419589 AW973708 HS2D1925 »FGF, HcmsapierscDNARJ1344663 t donePL 32 

415447 297171 Hs.78454 SS,Oi r ,OU : »CXF ( Rax«mLL4 rnyocDm, trabecular meshwonttndudbie 32 

443464 BE548446 Ha5167 SSJHSSF.SSJM Homo sapiens mRNA; cDNA DKFZp434F152 (fr 32 

423431 AA326062 ,$$ J p450p450 gb£ST29171 CereWumBHwro sapiens c 32 

40 413278 BE563085 Hs333 ^TM.i^o^^ainHruGM^l^GFJc interfercfK^lated pro 32 

458451 AW297181 Hs.195922 v SSJVbosomaLl14 ESTs 32 

440449 AA885430 Hs201925 fGF. Homo sapiens CDMFU13446 fleeter© a 32 

413753 U17760 Hs.75517 SS,lanitoL^^tlraUtten^ lan^beta3^lceln(125U}Xkalinm 32 

434876 AF160477 Hs£1460 ,SS,HLH tgsuperfacnBy receptor LNIR 32 

45 435575 AF213457 Hs.44234 SSJg,SS,TM rjlggerlrcreoaptor expressed on my^ 32 

415773 R21651 Hs324725 ^THRi>C5ornaLS3AB,6^amma ESTs,ModeiatelysirnfertoA47582&<jel 32 

446440 AV658411 Hs.42656 SS WAA1681 protein 32 

450847 NHJD03155HS25590 £S,r»meobox, stanriocafcto 1 32 

426075 AW513691 Hs270149 ^S^ ESTs, Weakly sMar to 210926QAB eel 32 

50 452110 T47667 Hs28005 3S,TM > Acfivtruecp l p«nase Konx sapiens cDNAaJ11309fis t ctone PL 32 

439963 AW247529 H&6793 JMip450 ( Qs ptatetetacfoafing tactoacetyfoydroia 32 

402837 NA SS ENSP0C00Q241312 t t£Xm7L8.13 (novel Sush 32 

439451 AF086Z70 Hs278554 ^.Chrorro^hao^^irorno, h etero c hi t ma Htepnrteinl 3.1 

406664 L34041 Hs4739 .SS/lXtanspwUtfo^ (tehydrogenase 1 (so 3.1 

55 417315 AKS0042 Hs336901 ,SS l RNAj>oUU?NAj>oU^^ 3.1 

413011 AW068115 Hs321 SSJJ*RXJ^.SS4JWOT,LRR. Ng^can 3.1 

414987 AA524394 Hs294022 ,SS l nrax*U»nT^ji^ hypofteflcaJ protein RJ14950 3.1 

429197 H24471 Hs26930 ,SS,Getso&i. ESTs, WeaJdys!mBartoT2Q272hypotje8 3.1 

: 448030 N30714 Hs325960 .SSJM (neirtirane^paraAig 4-dornates, subfamily A 3.1 

60 407604 AW191962 Hs24S239 £S,TKC1q, cofegen,&pe VOL alpha 2 3.1 

419092 J05581 Hs39603 SSJM.SEA, mucbl.trarcanerrtjrane 3.1 

456672 AK0Q2O16 Hs.114727 ,SSWK Horro sapter^ done M6G16327, mRNA, com 3.1 

443171 BE2B1128 Hs3030 SSJH7tmJ,nmSS TONDU 3.1 $ 

452256 AKDQQ933 Hs28661 .TT^GDlJtrnJ. Hon»sapfenscDNAFU10071 fis,doneHE 3.1 

65 432201 AI538613 Hs298241 SS f mtn^sm^TMM>94ryps^ TiansnienAfane protease, senne 3 3.1 

406642 AJ245210 SS gbi^ sapiens mRrMfor*^^ 3.1 

400903 NA SS Target Exon 3.1 
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434408 AI031771 Hs.132566 .SS^JrydroJ ESTs 3.1 

452994 AW962597 Hs31305 SS f WD40,SS,WD40. WAA1547 protein 3.1 

445903 AB474S7 Hs.132781 fo3,SSJHEF1BD class I cytoUne receptor 3.1 

424364 AW383225 fe201189 SS ESTs, WeaMy similar to G01763 atnophhv 3.1 

5 410196 A1936442 H&59838 UBACTjepeat^UBACTjepea^ 3.1 

419150 129618 Hs£9640 JHoHraseM TEKtyrosfrie kinase endoSieOa) (venous 3.1 

433417 AA587773 H&8859 ,SS,SRCR, Homo sapiens, Sinter to RKBJcDNA 5830 3.1 

418624 AT734Q80 Hs.104211 JSewaJg, ESTs 3.1 

436291 BE568452 Hs*101 .SS^bhydrolase. protein regulator of cytoktefe 1 3.1 

10 411000 N40449 H&201619 SS ESTs. WeaWydm8artoS38383SEB4B pro 3.1 

412519 AA196241 Hs.73980 SSJroporonJJerra^ troponin Tl.sketetal, stow 3.1 

450223 AA418204 Hs341493 .SS.proJsomerase, naluraJIdler^unwrecop^toseoA^ 3.1 

422790 AA809S75 Ha25933 JM^stone,Sec1Jii^^ ESTs 3.1 

424269 AW137691 Hs.199754 ,SS,TM,7dtl2,GPS ESTs 3.1 

IS 435854 AJ278120 Hs.4996 .SS.WD40 rjufeteardcyrin-repetf ^ 11 

447388 AW630534 Hs.76277 ,SS,TMjrn\oxkk)n3d_o6/aktored^ Homo sapiens, done MGC5381, raRNA, comp 3.1 

451631 R0Q866 SS gbcye79c0Zs1 Soares fetal Bverspteen 3.0 

448105 AW591433 Hs*298241 l SS > TMire^tryps{n t trefoD Trartsn^mbrane protease, serine 3 3D 

438637 BB00941 Hs.126730 JIAPK ESTs, Weatty sirrta to K1AA1214 protein 3D 

20 423024 AA593731 Hs325823 ,SSJM,CD36 ESTs, Moo^ralelysimDartoALU5_HUMANA 3D 

456592 R91600 ,SS^n^,UMJ^L3P1.^P,TPfUJM gfcyq1CoQ2Jl Soares fetal Bver spteen 3.0 

425920 AL049977 Hs.162209 SSJHSSJMfM^O^ 3.0 

444670 H58373 H&332938 ,SSJM rrypotefcal protein MGC5370 3.0 

401197 artart. ENSPO000Q229263*tiSPC213. 3.0 

25 437755 AW204256 Hs^91887 ,wnt ESTs 3.0 

452560 BE077084 Hs336432 ,SS^muH^BP^Hnas8,C24)krnase_C,r^ ESTs 10 

410274 AA381807 Hs.61762 SS.SS hypoda-Mucible protefo 2 10 

450098 W27249 Hs.8109 SS hypoftefca! proteh FU21080 10 

404825 ,SSJM TaigetExon 10 

30 458389 H70284 Hs.160152 ,SS*A ESTs,WeaJdys&rritotoFPmalpha-fetDp 10 

408196 AL034548 Hs.43627 HMGJ»X4)Kirmse^CCHC^TM^M6Jxix,SRY {sex o^errnhing region Y>*qx 22 10 

433675 AW977653 Hs.75319 .SS.rfeoniKLrecLsm, ribonudeofide reductase M2 polypeptide 3.0 

418848 AIB20961 Hs.193465 4ea1rvZU5^kinaseAdhm ESTs 10 

422095 A1868872 Hs£82804 SS,Cu-oxi&ise^CtHHfa^ hypo&eBcal protein FU22704 10 

35 415992 C05837 Hs.145807 .SSJM hypofheScaJ prate* FU13593 3D 

424631 AA688021 H&179808 SS ESTs 10 

409956 AW1 03384 Hs.727 SS,T(^^T(^j)iopBp8(te,SS,TGF^Irtf^ beta A (acfivh A, acfrtn AB a 3.0 

419667 AU077005 H&9220B SS^eirteg^RepfolysiaPep_M12B _prope a tfisWegiin and rnetaBoproteinase doma 10 

450946 AA374569 Hs.127698 SS ESTs, Moo«atelysirnSarto21(B2KlABc 10 

40 447770 AB032417 Hs.19545 Frizzted/z^S.THFrizzted.Fz. mzzled(Dn)St)phBa)homok]g4 10 

423826 U20325 Hs.1707 SS.SS cocare-aridarnphetanii^^ 10 

418838 AW385224 Hs35198 ,SS.THPhosphocfest ectonudeotopyroph^^ 3D 

442804 AW300118 Ks.131257 ,SS.TM,G^arnrna ESTs 3D 

432284 AA532807 Hs.105822 ,SSJHpkinase, ESTs 3D 
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TABLE 20A 



Table 20A shows the accession numbers for those pkeys lacking unigenelD's for Table 20. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

10 

Ptcey: Unique Eos probestJi frteiiEJkir number ' 

CAT number. Gene duster number 
Accession: Genbank accession mimbeis 

15 



Pkey 


CAT number 


Accessions 


410785 


1221055J 


AW803341 AW803265AW8Q34WAW803466AW803^ 


411667 


1253334J 


BE160198 AW935898 T11520 AW935930 AW856073 AW861034 


418636 


177402J 


AW749&55AA225995AW750208AW750206 


420854 


197072J 


AW296927 AI684514 AE63168 AA281079 


42212S 


211994.1 


AW881145 AA490718 M85637 AA304575TD6067 AA331991 


423431 


22816^1 


AA326062 AA325758 AW962182 


423345 


233566J 


AM10943AW948953AA334202AA332882 


426878 


273265J 


BE069341 AW748403 AUM4891 AI908240 AA393080 


433687 


373061.1 


AA743991 AA6048S2 AW272737 


447197 


711623J 


R36075AB66546 R36167 


451631 


878098.1 


R00866R01523AJ806815 


456207 


165078.-1 


AA193450 


456582 


2Q2684J 


R91600TB7079AA291455 
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TABLE 20B 



Table 20B shows the genomic positioning for those pkeys lacking unigene ID's and 
5 accession numbers in Table 20. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Ptey. Unk^tmmberconespondhg to an Eos probeset 

10 Ret Sequence sours. T1» 7 (figlnuntos in ^ 

entffledTheDrtt sequence of toinro chromosome 22/ DuiihamLeta^ Nature (1999) 402:489-495. 
Strand: I n de ate s DMA strand from which exons were predicted. 

NLposftfon: Incficstes nucteoftte poslfimts of predicted exons. 



15 



20 



25 



30 



35 



40 



Pkey Ref 

400608 9887666 

400903 2911732 

401045 8117619 

401093 8516137 

401197 9719705 

401747 9789672 



401780 
401781 
401866 
402230 
402408 
402578 
402606 
402837 
403199 
403329 



7249190 
7249190 
8018106 
9966312 
9796239 



9369121 
9958183 
8516120 



403943 
404091 
404347 
404626 
405366 
405654 



7711864 
7684554 
9838195 
6572184 
2182280 
4895155 



Strand Nlposffion 



Minus 

Plus 

Plus 

Minus 

Phis 

Minus 

Minus 

Minus 

Pbs 

fcftius 

Minus 

Plus 

Minus 

Minus 

ftffinus 

Plus 

Minus 

Plus 

Minus 

Plus 

Plus 

Plus 

Minus 



96756-97558 
5911239228 

90044-90184,91111-91345 

2233523166 

176341-176452 

118598-118816.119119-119244^1960*1^ 

131932,132451-132575.133560-13401 1 

28397-28617.2892()-290453135-29296;29411-295^ 

8321&^3435^1-8365633740^01 t 84237-84393^95&^37,86290^M 

73126-73623 

29782-29932 

110326-110491 

6635056496 

6174742094 

2013^186,9570^58,1113&.1imi9429-19677,21210-^ 



62S54*6Z71£6944&£9602 

100742-100904,101322-101503 

82121-83229 

74493-74829 

47726-48046 

2247842632 

53624-53759 
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TABLE 21: 210 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT TISSUES THAT ARE 
5 LIKELY TO ENCODE EITHER ENZYMES OR PROTEINS 

AMENABLE TO MODULATION BY SMALL MOLECULES 

Table 21 shows 210 genes up-regulated in breast cancer compared to normal adult tissues 
that are likely to encode either enzymes or proteins amenable to modulation by small 

1 0 molecules. These were selected as for Table 19, except that the ratio was greater than or equal 
to 3.0, the "average" normal adult tissue level was set to the 85* percentile value amongst 
144 non-malignant tissues, and the 96 th percentile value amongst 73 breast cancers was 
greater than or equal 80 units, and the predicted protein contained a structural domain that is 
indicative of enzymatic function or of being modulatable by small molecules (e.g. pkinase, 

1 5 peptidase, phosphatase, ATPase, or ion transporter domains). The predicted protein domains 
are noted. 



Pkey: Un^ Eos probeset Identifier numbef 

20 ExAocre Exemplar Accession number, Genbank accession number 

Un&ene©: IWgene number 

UnfeeneTOe: Unfeenegenaffite 

R1: Ratio d 93rd percentile tumor to 85* pe^^ 



25 



30 



35 



40 



402578 
425692 
424001 
45 418007 
421727 



50 



55 



Pkey ExAccn UnJgenelD Predicted Protein Domains 



449746 
400292 
424735 
407178 
408045 
450375 
429170 
445730 
424634 
420757 
424399 
447350 
456207 



443348 
424088 
400295 
408771 
421155 
424905 
438167 
459583 
423945 
445263 



AJ668594 Hs.176588 
AA250737 Hs.72472 
U31875 H&272499 
AA195651 Hs.104106 
AW138959 H&245123 
AA009647 Hsl8850 
N*L001394>feL2359 
AI624342 Hs.170042 
NM.0O3613HS. 151407 
X78592 Hs39915 
AI905687 Hs2533 
AB75572 Hs.172634 
AA193450 

X52509 Hs.161640 

D90041 Hs.155956 
W67883 Hs. 137476 
M13509 Hs33169 
Y13153 Hs.107318 
W20Q27 Hs^3439 
XD7820 Hs2258 
AW873596 Hs.182278 
AB51010 Hs.102267 
W72838 H&2533 
AW7325 73 Hs.47584 
H87879 Hs.102267 
NM.002497HS.153704 
R28363 Hik24286 
AI907673 
AA41Q943 

H57646 Hs.42586 



SS,p450 

oWih£U5J*lAcflviu^ 
SSJM 

SS^thydroorolase, 
PhosphoaOestSc^tomedfruB, 

$SJft*o1sMegritf>epJM^ adtsfrrtegifoandmeta!k)prt 



UnlgeneTftJe 

ESTs, Weakly similar to CP4Y J1UMAN CYTOC 
BMP-RIB 

shork^alccnolcfehydrogenasetai^ 
ESTs 
ESTs 



DSPcJtacdanese, 
SSJM,Cafior\_ef8ux 
BJspJ^SMA 

hcnricne <- rec f AnoVDgerLjecep^f-C4, 
SS 



SSJHp450.p450 



dual specificity phosphatase 4 
ESTs 

carBage htermedlaie layer pfoteti nu 
androgen racfcptuf (dhydrriestosterone r 
atdeb^dehydrogerase9£^ 
ESTs 

gb2rt0e07j1 SoaresJflirM^SI Homosapi 



SSJM^mirolrarU^^ aminotransferase 

SS,p450^S,THp450 C1001134gfJ2117372|pirl|[K981 fafiy ac 

SS,Acetyftransf2, li^ltransferase 1 (ajyiamheh^acety 

pfcfnase, paternally expressed 10 

SSJwropexirtfep^ metaBoprotefftase 1 (KWP1; Inters 

kyruirentrm3-mcwuxayger^0cy^^ 

SS,Pep8dase_M1 l ESTs 

temop<ahPeptkJa^ melaDorjrotehase 10 (MNff>10;str 

SS.DENN calmodufc2(phospta^ 

S^LysyLcotdase iysylcattase 

SS alcihyte dehydrogenase 9 f^.mer^ 

BUOetn^lonJrar^ potasswnv^ge^alBdcharftei delayed 

SS4.ysyLoxfdaseJ\^^ lysyiaxfclase 

pWnase^THpltoas^ NBbIA (never In iratosts gene a)^elatedk 

SSJM7trnJ.p450,nm ESTs 

pWnase, gML-BT152-08Q3994JQ4 BT152 Homo sapfen 

c^ath7U5 > TM^^_n3cp,pWnase, gbczB2hQ3.r1 Soaies ovary tumor NbHOT K 

SS.AcySransferase, K2AA1560 protefrt 
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R1 

65.7 
553 
533 
39.3 
343 
25.7 
243 
24.1 
21.7 
21J 
203 

m 

183 
18.1 
173 
16.7 

15J 
153 
133 
133 
133 
123 
127 
113 
113 
11.7 
115 
113 
114 

m 
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42J887 AL080207 Hs.134585 SS;THBRCT v ank^Jran^ABCJran DKFZP434G232protKn 109 

429859 NM_007050ffe.225952 SS.THYj)hosphatese^AMM prated tyrosine phosphatase, receptor t 10.4 

428368 BE440042 Hs33326 SSJPepttiasejM^ 103 

418912 NM_000685Hs39472 SSJMl7trtLlSSm7tnU angloten^n receptor 1 103 

5 451952 AL120173 H&301663 . SS,pkhase t ESTs 103 

402408 NA SS^jinrtydtese NMJB0920*flomo sapiens hypothetical pro 93 

415539 AI733881 H&72472 deidh^THAcfivb^^ BMP-R1B 9.4 

406687 M31126 H&272620 SSPepfio^J^IOMnope^^Pepteejratrix metaflopnrtefnase 11 (MMP11; stro 9.1 

419948 AB041Q35 H&S3847 Fenkueduct.TM^efripjeo^d, NMJH6931:Homo sapiens NADPH oxidase 4 ( 9.1 

10 400285 NA TM^_tran/iBC_rnente)e, Eos Control 83 

408380 AF123050 Hs.44532 SS.TM^Aj^/friua/NFjeceptof^ oWquttt 83 

409203 AA780473 H&387 SSj)450 i SS > p450 cytochrome P450, subfern3y IVB, poiypept 83 

424902 WL0Q3866Hs.153687 SS,SS Inositol poJyphosftfraJe^osphalase, 83 

431725 XG5724 H&2839 S&CysJcnotSS Nonie disease (pseudopjioma) 83 

IS 418092 R45154 Hs.106604 o^ZU5,p»dnase>^ctMn^^ ESTs 83 

427811 M81057 Hs.180884 SS^carbOpepU>ropepJ^ (fissue) B2 

423554 M90516 Hs.1674 GATase_£SIS, ghrtarnhe4rudDse^iih()sphate 8.1 

428261 AW242243 Hs.168670 SS,THWD40,ubkiuain£1-E^ protein 73 

413374 NM_0OtQ34HsJ5319 SS rfeonucfeoto reductase 73 

20 432677 NMJ»4482Hs378B11 SSJHGtycosJrar^2Jfc^^ UDP-r±*cetykir^^ 73 

456986 D38299 Hs.170917 SS,7M.7trn_1, prostaglandin E receptor 3 (subtype EP3) 73 

453331 AE40665 Hs3895 SS,7Htfs!ntegrto^Jfl2^ ESTs 73 

407721 Y1Z735 Hs38018 pSdnase, Aiakjpec^tyrostoe^Y^^ 72 

418004 U37519 Hs37539 SS f atiedh > SS f a)dedh, aldehyde dehydrogenase 3 fem3y, member 72 

25 410555 U92649 Hs34311 THd^tegrh)*epro}ysia adstotegrhandrretafl^ 7.1 

443695 AW204099 Hs337720 ESTs, Weakly s)m3ar to AF12G780 1 retin 63 

423545 AP000692 Hs.129781 GAF,PDEase chromosome 21 open readtag frame 5 63 

401045 ICE^^SJC^IOJC^yCEjIO^C^p 

C11001883*gi|6753278|iefjNP 03393&.1|c 6.7 

30 442082 R41823 Hs.7413 TM^ ) H_lbd 1 pJdnase T SAM t &i3. ESTs; cafeym*nlfc2 67 

418836 AI655499 Hs.161712 TM^dMn_rec^^ ESTs 63 

404091 NA TM,7tiu3 f ANF>ceptor l TargetExon 63 

450865 AI248013 Ks.106532 zH2H2 ESTs, WeaWy similar to 138588 reverse t 63 

424085 NMJDQ2914HS.139226 SSAAA.Vira|J)eIkase1 > nm, repGca5on^dorC(ac&vator1}2(40 63 

35 449650 APD55575 H&23838 TMiort_trans,SS,TT^Wj^ans, cafctum channel voltage-dependent, Lty 6.4 

432304 AA932186 Ks.69297 TM,7tmJ, ESTs 6.2 

415669 NM.005025H3J8589 SS^etpm, serine (or cysteine) proteinase inhfofto * 6.2 

442942 AW167087 Hs.131562 SSjg^Serna^ktoase, ESTs 6.2 

428795 R45503 Hs37469 SSJM ESTs,H)ghVskTtotoA39769 r4^cetyfl 6.1 

40 428479 Y0Q272 Hs.184572 SS.pldnasejiktnase eel Division cyde 2, G1 to Sand G2 to 6.1 

400300 XD3363 SSJMjfctoaseJteepJ^don^ 6.1 

447359 NMJD12093HS.18268 SS^rtJTP/tateHnase, adenylate Hnase 5 6.1 

402230 NA SSJHP45Q, TajgetExon 6.1 

424687 J05070 Hs.151738 SSPepHasejmtn2 ( her^ 53 

45 432328 A1572739 Hs.195471 6PF2KJ 3 GAM, &*hosphofrudo^-Jd^^ 53 

432239 XB1334 Hsw2936 SSJ 3 epftteseJ^0 t hemopexh t S$Pe^ rnetofloprotehase 13 (coteoenase 53 

400286 NA SSJM^Jran^jnemta^ C160O)9225ll74»103lpfrffT20903 hypothe 53 

43247 NMJ005940HS.155324 SSPepfio^J^O^riopexKSS matrtein8!aioprc4B^110«^1;sfro 5.4 

434737 AAB28248 Hs^91884 UCH-l.pttias^OPIWxx^^ 5.4 

50 439310 AP086120 Hs.102793 SS.THUDPGT.caseftUtappa ESTs 52 

441111 AI606867 Hs.126594 SS.THPrwsprodiest, ESTs 5.2 

452355 N54926 H&29202 TM,7tm_1 f TM G protetrKXJUpfed receptor 34 S2 

427711 M31659 Hs.180408 SS sotutocan1erfam8y25(mitochaK^ 52 

429353 AL117406 Hk2C01Q2 SS,TM^ Jran^ jrerrfcrane, ATP-bfrxflng cassette transporter MRP8 5.1 . 

55 425325 X52730 fe.1892 ' SS.NNMTJ^NKajBfl'.SS^ 

phenyteftianoian^ne^nemy&ranslBrase 5.1 

448706 AW291095 H&21814 SSJMj*hase, Marietta 20 receptor, alpha 5.1 

403593 NA - CID&KpWnase Target Exon 5.1 

432777 AA564991 H&269477 alpha^rnytase, ESTs 53 

60 446232 A1281848 Hs.194691 SS.1H7lnvJJU)osc«naU13 letfoofc add Induced 3 43 

411165 NM_aXM69Hs.69089 SSJtefoteatfTKJ^^ 43 

414117 W88559 Hs.1787 TMJonJransXJetra, prcrteoBpUprote^l (Pefizaeus^^ 43 

411096 U80D34 Hs.68583 PepfidaseJO. niftochoKirtalb!^^ 43 f 

450506 NMJ)04460Hs.418 SS^ryj^_^rmPepfidase^S9.SSJDPP/V _N_0robtet acflvafion protein, alpha 43 

65 417975 AA641838 Ks30085 SS.trypsfn hypome^ protein FU23186 43 

447752 M73700 Hs.105938 SS,traftferr^7^ iactotransferrm 43 

427122 AW057738 Hs323910 SSJM^Hnasitf^^ 43 
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400181 MA SSJR3Beta_HSD. ENSPC00C0171555.-CDNA FU10727 fis, done 4.6 

452(03 AA447453 H&27860 SS.WImJ, Honro sapiens mRNA;cOMADKFZp^6ftOT723(f 4.6 

435542 AA887376 Hs£69533 SS>pIitoaseJ*rto(» l lgPriSH3, ESTs 4.6 

417576 AA339449 Hs.82285 AlRS > forntyLtransf,GARS ? SS,GARS r AIRS,fDr phosphoribosytglydnamtde fomiyttransfer 4.6 

5 446089 A186C021 HsJZ70851 pftnase ESTs, &£od8ralBly siirfer to A47582 B-cel 4.6 

42442D BE614743 Hs.146688 SS.THMAPEG, pfostagEamfinEsynfliase 43 

4521 SO H28735 Hs31668 SS,WHSH2^urW2^^ Homo sapiens done PP1498 unhnossi mRNA 45 

419986 AI345455 Hs.78915 pttfnass.CPR, GMntdtog proteto Iranscripfion factor, 43 

421582 AJ910275 Hs.14CS S8^e&^mtCre(^O.SRCR t byps trefo3 factor 1(pS2) 45 

10 446733 AA863m Hs^6040 SS.TM.p450, EST* WeaHys&n&r to fatty acid omega 4.4 

4530S0 AW294092 Hs£1594 S8^as l Y_phosp!Qtose T ras hypo$hs£caJ protein M8C15754 4.4 

400205 MA N^G06265 ft *toroo sapfens RAD21 (S. pomte) 44 

42)854 AW296927 SS,WeptrdaseJA1, gM^BWO*|cW-04JU1 NCI_CGAP_Su 43 

4326S0 AF1814S0 HSw278S27 SSj^jBte t SS^fiJcsomaLU9 prenylc y stelre lyase 43 

15 441560 F13386 Hs.7888 pStase, Homo sapfens done 23736 mRNA sequence 43 

416445 AL043C04 Hs.79337 SSjrfdnase, WAA0135 pjotein 43 

439024 RS869S Hs355S3 SSJWrypsIn,v^lg ESTs 43 

432882 WL013257Hs279698 pJdnass^&ias8_C, serun^tocojrfccid regulated kinase^ 4J2 

447754 AW073310 Hs.163533 pitfnase, Homo sapiens cDNA FU14142 fe, clone MA 4.2 

20 453775 NNL002916Hs35120 SS.AAA^GJ>»4Jdras9j>l3Ka.Pl^iba\PI3 rep&attonfectorC (activator 1)4 (37 4.2 

431G57 AI345227 Hs.105448 SSJHp&rass ESTs, WeaJdy smnferto B34087 hypofftefl 4.1 

427859 AA832B8 Hs332053 SS.SAAjn^etnsABCjmenibran^ 4.1 

42338 NH.C01809Hs.1594 SS,7Hftfckse. cerrfnmisfe protein A (17fcD) 4.1 

418476 U&945 Hs.1174 anXanfr cycEn-dep8itdenttdr^tnh1b^2A(iTi8 4.1 

25 4CS922 S70284 SSJWtesaturase^S gbstearoyWCcA dasattnase {human, adlpo 4.1 

439285 AL133916 Hs.172572 SS&j&toaseJJ^JJWCT, hypo9tafical proteb FU20093 4.1 

429922 Z97630 Ha226117 SSJHEntoJilstor^TtrnJ HI Hstonefan% member 0 4.1 

420139 NfcL005357Hs35351 SS.7M.p450, Rpase, homorta^ensffitfa 43 

425071 MML013989HS.154424 SS,T4_daia&fflse,T4_dmfirase, deMnase, todoffiyrwte. type II 4.0 

30 424511 BE30Q512 Hs.193557 SS.Yj»hospha1asB ( BfflttL41 ESTs. Afooerateiy slmBar to ALU7 JiUMAN A 4.0 

428722 U76456 Hs.190787 SS.TOP, flssue Enhfotorof nwlaltoprotetoase 4 33 

428330 L22524 HsJ2258 SS^epfidaseJtf10,SSPepthiase JU10,rtfm>pematjfe rnetaEoproiefoase 7 (WBOT; uterin 33 

414831 M31158 Hs.77439 SS.C&MP J*K&^Ua>GVlGjMK protefn kinase. cAMP-dependent, regubto 3.9 

413589 AW452631 Hs313803 SS^bftytfrotee ESTs, HigWy simSar to AF157B33 1 none* 3.8 

35 41863 BE2475S0 Hs36859 SS,»eWSSJI\4PH#Vun*ea^^ 33 

413453 AA13S40 Hs.128085 SS.Pep&bsB_.C1.gpdh ESTs 33 

403943 p450^p450 CKJC0355^i|45032»|ref}HP^0C0765.1| cyt 33 

444618 AV653785 Hs.173334 ELL-RELATED RNA POLYMERASE tl. ELONGATIO 33 

408761 AA0572S4 Ha23B936 SSJM,7tmJ, ESTs, Weafefy sbnRar to (deffina not ava 3.8 

40 427809 M26380 Hs.160878 SS£pase^LAT,Sec7,PH, BpopiDtobt^se 3.7 

418203 X54942 Hs.83758 CKS,SS,CKS, CDC28 pictein kinase 2 3.7 

454071 AI041793 Hs,42502 TM.7tm_1, ESTs 37 

424676 Y085S5 Hs.1 51678 (^cos_transL2 f Rh^Jsc&vSS.G^ UDP-N-ecetykdprt3r^ 3.7 

457465 AW301344 Hs.12308 SSPnbCKytovSufetee DMA recreate factor 37 

45 417KM NM_014735Hs32292 PHD.p^nase.SS WAA0215 germ product 3.7 

446619 AU076643 Hs313 SSJ^e&iandMJrans seoBSedptosptopro^ 3.7 

4C6S25 Y13647 Hs.119597 SSJHDasalttiase^S steaiDyW^desatuiase((te^d8satUT 35 

428769 AW207175 Hs.106771 SS,7bTt.1^RY, ESTs 33 

426310 NMJCOS09HS.1692S6 S$JM,7tmJ. nempspSdeYnjcoptorYl 3.6 

50 417531 NWLC03157HS.1087 SS^nass,v^w^GVco^tratst8 sertneymreonbtet^tas82 35 

444781 t^L014400Hs.119S0 SSPH^tctemase^B GR-anchored me^^ 3.6 

431493 AI791493 Ks.129873 Ssip450^»49) ESTs,Wea^sbiiartoA36036cytocWn 3.6 

428966 AF059214 Hs.194687 chotestefol 2Shytaqtesa 35 

414175 AI308S76 Hs.103849 TM^entopsxinPep&lase^ prote5i DKFZp761D112 35 

55 4553S AW895719 TiyUonJbansJUatra, gb^44>!NC039-S03CO-154-fl» ^!^^C039 Homo 3.6 

429597 N%LP03816H&2442 acfisIntBgrlnaiidmaMJoprDteiiQsec^ 35 

425320 03344 Ks33190 A^trar^^oTLZfricMKcy^ faflyecsdsyrtea 35 

431854 AA383550 Hsl271699 IMS.SS potyni83as9(DNA tected) toia 35 

407104 S573S H&323910 SS,mSH2PH4Jldna^ecepJ n jta^^ 35 

60 44S051 AW5S14C0 Hs.333526 SS HBC receptor tyn^ltaselt^^rb^. 35 

434398 AA121C38 Hs3838 pkbas8/>0L0Jjas^»^^ s&nQr^uctohtoase 35 

454042 H22570 Hs.172572 SS^Wnase^RRMTJJ^RCT, hypo3ie£cslpiotebFU20f»3 35 

407756 AA116Q21 Ks382S> SS,UCH-1,UCH^SS,MG^bUr»ispept ubajuainspecgfc protease 18 35 

401464 AF039241 Hs3028 Pep£dasejti24, Mstonadeacs^fasaS 35 

65 41370 AB02843S Hs.177534 DSPc^yostrtM duaJsp8cEc% phosphatase 10 35 

412049 N53437 Hs.18268 SS^der^tateEanase, adenyfatohtnase5 35 

425776 U25128 Hs.1594^ SS,TM,7tn\^SS,TH7tnU para%rt»a homtone receptor 2 3.4 
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407846 AA4262Q2 Hs.40403 
406925 L34041 H&9739 
445873 AA250970 K&251946 
418054 NM.0Q2318HS.83354 
5 406815 AA833930 Hs28S036 
410530 M25809 Hs.64173 
407021 U52077 
421168 AF182277 Hs330780 
431473 AA825686 Hs321176 

10 408101 AW968504 Hs.123073 
422083 NMJ01141HS.111256 
411393 AW797437 Hs.69771 
435767 H73505 Hs.117874 
433068 NM.006456H9288215 

IS 428328 AFO37062 Hs.172914 
414575 H11257 Ha22968 
445941 A12S7371 Hs.172636 
444542 A1161293 Hs280380 
425741 AF052152 Hs.159412 

20 434228 Z42047 H&283978 
433264 D85782 H&3229 
400419 AFD84545 
439750 AL359053 H&57664 
417757 R19897 Hs.106604 

25 452194 A1694413 Hs332649 
421456 NM.0Q3G54H9.1045T6 
443767 BE562136 Hfc9736 
422648 D86983 H&118893 
423431 AA32G0&2 

30 451264 AI768235 

452110 T47667 Hs28005 
439963 AW247529 Hs.6793 
453941 U39817 H&36820 
406664 L34041 Hsl9739 

35 phosphate dehydrogenase 1 (so 
453487 R31770 H&23540 
420911 U77413 Hs.100293 
443171 BE281128 H&9030 
452256 AKD00933 Hsl28661 

40 432201 AJ538613 Hs298241 
419150 T29618 H&89640 
444443 A1149286 Hs55099 
426283 NMJXK937H3.169139 
436291 BE568452 Hs5101 

45 450223 AA418204 H&241493 
424289 AW137G91 Hs.199754 
448105 AW591433 Hsl298241 
452560 BB077084 H&336432 



THABCjrentone J ABCJiaaRIbo^ transadivator, wt 

SSJWtatsp&UnA^ dehydrogenase 1 (so 



SS^mwPABP > pHnase,1 4-3-3,irm 
SS,TMjrifto_canrJ.ysyLoxte8 
SSJPPT, 

ATP-synLab,SS.7tmJ ATP-synLab 
SS.p450.SS 

ss 



RpocygenasepLAT, 
SS r a^tiypsh f vwa^Tn\ffl)itrK)geruC l 
SSpep8dase.S8P > Pep&dase_S8 t P 
SS.PribosySran, 

SS^ctLshortTGF-oetaJGFb jxcpepfide 

SSjAhasafa. 

SS.SSJipoxygenase^lAT 

pHnase, 



pdy(A)-blnc5ng protein, cytoplasmic 1-1 
rysyfoxttase-fte2 
tRNA tsopenferiytpyTDpho3phaiebansfef33 
ATPase, H transpo^ng, lysosomal (vacuo 
gtxHuman mariner! transposasegene»oornp 
cyhxhm»P450,subfamiyllB(phenobar 
ESTs, WaaMy skraJartD S65824 reverse t 
COC2-refated protein kinase 7 
aracrtHonafe 15-flpoxygenase, second typ 
B-Cactor, propeidlh 
ESTs 



SS I PepMaseJ41 l 
7M^nt6g/iftjB^liciiuB_ledin/r7D 
deathy^LB^pUuise^cfivfrurecp, 
SS,TIA7liTiJ3^F.feceptor t susM 
SS 

SSPCUlasGa : Mnione„recy«M, 
peroMdase.LRRCT, 
SS,p450,p450 
SS>TrehaJas8 

SS^THAcfiranijecp.pUriase 
THp450,Ets 

SSJDEADWDC^eGcase_C, 
SS.TMJranspotUxotSWIB.Rr^ 
3.1 



refind ctehydrogenase 5 (11-ds and M 
Homo sapiens done IMAG&451939, mRNA se 
ESTs 

anti h op cpSdase 

Homo sapiens done 24628 mRNA sequence 
Homo sapiens PR02751 mRNA, complete cds 
cysteine dloxygenase, type I 
Target 

Homo sapfens mRNA fuD tengtti tosert cDN 
ESTs 

olfactory receptor, family 2, subfamffy 
caiboliyibule (keratin sulfate Ga(-6) sii 
proteasomo (prosome, macropaJh) 26S subu 
Melanoma associated gene 
gb£ST29171 Cerebellum D Homo sapiens c 
gbwD82g08jrt ScaresJ*SFJ*^JJrj>AJLS 
Heme sapiens cmRJ11309^ctone PL 



Bloom syndrome 



SSJM,7tm_1,nmSS 
TM,GO» f 7tnru1. 
SS.TKtrypsh^,m^ 
TM,pMna3e,m3, 



SS^bhydrotase, 
SS,pfD_feofnejase, 
SS,TM,7lmJ^GPS 
SS,TM,trefo^trypsin4n3foIl 
SS,mnj*Raj^,pWnase t C2,p4to ESTs 



ESTs 

ainfcedK^cety^kJcceanmie (G)cNAc)tr 
TONDU 

Homo sapiens cDNA RJ10071 fis, done HE 
Tfansmembfane protease, serine 3 
TEK tyrosine kinase, endothelial (venous 
rab6 GfTPase acftraflng protein (GAP and 
kynurertirBse(L-4cynuierilne hydrolase) . 
protein regulator of cytokinesis 1 
natural kfflef^umoriecognfl^) sequenc 
ESTs 
Tra 



3.4 
3.4 
3.4 
.3.4 
34 
3.4 
3.4 
3.4 
3.4 
14 
33 
3.3 
33 
33 
33 
33 
33 
33 
33 
33 
33 
33 
33 
33 
12 
32 
32 
32 
32 
32 
32 
32 
3.1 

gfycero£3- 

3.1 
3,1 
3.1 
3.1 
3.1 
3.1 

3.1 

3.1 
3.1 
31 
3.1 

ao 

3.0 
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TABLE 21 A 

Table 21 A shows the accession numbers for those pkeys lacking unigenelD's for Table 21. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Ptey: Unique Eos probeset identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 

Pfcey CAT number Accessions 



420854 197072.1 AW296927 A1684514 A1263168 AA281079 

20 423431 2281GL1 AA326062AA325758AW962182 

423945 233566J AM10943 AW948953 AA334202 AA332882 

451264 863988.1 AI768235 R31400 H29082 H23107 

455325 1279475J AW895719 M31451 N41451 

456207 165078 -1 AA193450 
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TABLE 21B 



Table 21B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 21. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 



10 



15 



20 



Ptey: 
Ret 

Strand 



Pkey Ref 



Unique number cofrespondktg to an Eos probeset 

Sequence sourca Th8 7aTgammiberaintrdscohin™are Dunham L etal" refers to the publication 

enffitedTheDNA sequenced human chromosome 22." Dunham L et aL, Nature (1999) 402:489495. 
trKlkztasr^tAsfe^fomwhtaex^ 
Indtcstes nucleotide posHwns of predicted exons. 



Strand Nt_posraon 



401045 8117619 Pius 

402230 9966312 Mnus 

402408 9796239 Mnus 

402578 9884928 Phis 

403593 6862650 Minus 

403943 7711864 Phis 

404091 7684554 Minus 



90044-90184,91111-91345 

29782-23932 

110326-110491 

6635066496 

62554&712&44&69602 

100742-100904,101322-101503 

82121-83229 
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TABLE 22: 739 GENES UP-REGULATED ESf BREAST CANCER 
COMPARED TO NORMAL ADULT BREAST 

Table 22 shows 739 genes up-regulated in breast cancer compared to normal adult breast. 
These were selected as for Table 19, except that the ratio was greater than or equal to 3.0, the 
denominator was the 85* percentile value for 12 non-malignant breast specimens, and the 
96 th percentile value amongst the 73 breast cancers was greater than or equal 100 units. 



Pkey: Unique Bos probeset Identifier number 

ExAocrc Exempt Accession rnnTto.Genbankaocessk^numb^ 

UnjgeneB): Unjgene number 

IS UnJgeneTffle: Ungate gene ffib 

R1: Ratio of 90* percentile tumor to 85* peroentfle normal breast Sssue 



Pkey ExAccn UnJgene© UnigeneTWe R1 



400292 AA250737 Hs.72472 BNP-R1B 515 

424735 U31875 Hs272499 srnrt<^albQhc408hya^ 383 

400297 AI1Z7076 Hs334473 hypofiie&a) proton DKFZp56401278 293 

431448 AL137517 Hs334473 typothefca) protein DKFZp56401Z78 265 

25 451110 AI9S5040 Hs2S5398 EST* WeaWysfetotofcansfofnBfiorK 253 

431211 M86849 Hs323733 g^jioicto protein, beta 2, 2Qd) (conn 232 

418203 X54942 H&83758 CDC28|rotein kinase 2 223 

407980 AA04S309 gtezf12fD1.s1 Soajes^fetaU»anLNbHH19W 193 

414646 AA353776 Hs301 C048anBgen(^ineinbraneprotBfai) 183 

30 446921 AB012113 Hs.16530 srnalMuc^cytDki)esiib^A(Cy 183 

409041 AB033025 Hs30081 K1AA1199 protein 173 

412140 AA219691 Hs73625 RAB8 toteractfog. Wnesto-llte (rabkines 173 

407824 AA147884 Hs3812 Homo sapiens cDNA RJ14388 8s, clone HE 17.1 

453160 AB63307 Hs239884 H2Brustonefam3y,rr*miberL 173 

35 407137 T97307 gbcye53h05\s1 Soaresfetal fiver spleen 16.1 

425692 D90041 Hs.155956 N^celxttransfefase 1 (andante Nicety 16.1 

438533 AM40266 Hs.170673 E STs, WeaMy sknfer to T24832 hypofoea 163 

428227 AA321649 Hs2248 sina]trKhicB}tocytDtesub&ndyB{CX 153 

444342 NKL014398HS.108&7 sJrr^ to tysosorr^ssodatBd membra 15.1 

40 422505 AL120862 Hs.124165 r^ranYnedceDdeaff)9(PDCD9) 143 

430515 AA746503 Hs283313 ESTs 147 

417308 H60720 Hs31892 WAA0101 gene product 14.4 

45Z744 AC67652 Hs30604 mmsa^nPW\;<Ot^Om^MBS^^4A 

412446 AI768015 Hs32127 ESTs 142 

45 415539 AI733881 Hs.72472 BMP-R1B 14.1 

435496 AW840171 Hs265398 ESTs, WeaHy sfmrfer to transformaton-r 133 

438209 AL120659 Hs3111 aiyt^rocarbon recep^ 133 
400205 NA N^006265*-itorno sapiens RAD21 (& pombe)133 

430365 M489732 Hs.154918 ESTs 134 

50 415263 AA948033 Hs.130853 ESTs 133 

451952 AL120173 Hs301663 ESTs 132 

449722 BE280O74 Hs23960 cycfinBV . 132 

406685 M18728 gteHimrian norer^cfccrossreactm arrtig 133 

406690 M29540 Hs220529 cardraeint^ 123 

55 429925 NKL000786Hs226213 cytochrome P450, 51 (fanosterol 14*feha 123 

416498 U33632 Hs.79351 potassium cnaiinelsub^ 127 

432378 AW93046 Hs.146133 ESTs 123 

441377 BE218239 Hs202656 ESTs 123 
45S207 AA193450 gbczr40e07i1 SoaresJ^hHMPiLSI Homo sapi12.4 

60 422805 AA436989 Hs.121017 H^Mstmlan^.niemberA 122 

407811 AW190902 Hs.40098 cysteine tadsupoJanalyl. BMP antagon 122 

407178 AA195651 Hs.104106 ESTs 122 
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420931 AF0441S7 Hs.100431 sm^MucIUeGytoltlnsBsuMam^lCy 111 

421727 Y13153 Hs.107318 kymmfaZwonooxw 12.0 

434408 AI031771 Ha.132586 ESTs 12X) 

446591 H44186 Hs.15456 POZ (kwrah containing 1 113 

431385 BE178536 Hs.11090 membran&^pannhfl 4-domains, subfamByA 113 

443348 AW873596 Hs.182278 c^itodu&i 2 (ftephoybse Knase,delt 11.7 

416602 NM_006159Hs.79389 nei (chfcfcenHke 2 11.7 

433365 AF026944 Hs293797 ESTs 113 

437866 AA156781 Hs.74170 rreta9othfaneIn 1E (funcfional) 115 

412472 AW975398.Hs293836 ESTs 11.4 

41G030 H15261 H&21348 ESTs 113 

439979 AW600291 Hs.6823 hypofteficaJ protein FU10430 113 

420757 X78592 Hs39915 androgen recepfr{d!iydrotesta 113 

411598 BE336654 Hs70937 H3fetor»fan% t member A 112 

423600 AJ633559 Hs31D359 ESTs 112 

430770 AA765694 Hs.123296 ESTs 113 

421037 AI6B4808 Hs.197653 programmed cefl death 9 (POCD9) 103 

452461 N78223 Hs.106106 transcription facto 10J 

409269 AA57G953 Hs22972 hypofte&al protein FU13352 105 

417791 AW965339 Hs.111471 ESTs 103 

447268 AJ370413 Hs36563 hypofte fc alpMtefaFU22418 104 

424001 W67883 Hs.137476 patemaBy expressed 10 104 

447342 A1199268 Hs.19322 Homo sapiens, StaBartoRIKB*cDNA 2010 104 

424905 NiyL0Q2497Hs.1537O4 (rover In mSosis gene a^Blatedk 10.1 
453619 H87648 Hs.33922 Homo sapiens, done MG&9084, mRNA, complO.I 

442942 AW167087 Hs.131562 ESTs 10.1 

434377 AW137148 Hs306583 Homo sapiens cDNAFUl 1382 fis, done HE 10.1 

427217 AA399272 Hs.144341 ESTs 10.1 

445730 AJ624342 Hs.170042 ESTs 103 

432887 AI926047 Hs.162859 ESTs 103 

452243 AL355715 Hs28555 programmed cell deaft 9 93 

424590 AW966399 Hs.46821 hypofteficalpiotBfoFU2Q086 93 

432169 Y00971 Hs2910 phosphonbos y t pyrophosphate synthetase 93 

438950 H23789 Hs.144530 EST 93 

418836 AI655499 Hs.161712 ESTs 93 

430291 AV660345 Hs238126 CGW9 protein 93 

444665 BE613126 Hs.47783 Baggresshntyniphomagene 9.7 
407377 C16391 gb£16391 Ctarrtech human aorta poiyA mRN 9.7 

445413 AA151342 Hs.12677 CGM47 protein 9.7 

443462 AU64690 Hs.171176 ESTs 9.7 

442145 A1022650 Hs.8117 erbU4nterac&g protein ERBW 97 

435570 AF212222 Hs.177812 unchafacten>edbonen^owprdBinBM04 97 

439820 AL3602O4 HS283853 Honwsap^niRNAfultengtti Insert cDN 93 

428966 AF059214 Hs.194687 cholesterol 254njdroxy!ase 93 

449448 D60730 HS57471 ESTs 93 

433929 AI375499 H&27379 ESTs 95 

432731 R31178 Hs287820 ftronecflnl 93 

411815 AA156679 Hs.125790 leucterich repea^oontaining 2 93 

415385 R17798 Hs7535 COBWte protek) 93 

422026 U80736 Hs.110826 trtacteofide repeat cwitafrAig 9 92 

432596 AJ224741 H&278461 matin 3 92 

439451 AF086270 H&278554 hetenxIsomaMce protein 1 92 

423945 AA410943 gbezt32h03lr1 Soaresovary tumor NbHOTH 9.1 

442432 BE093589 Hs38178 hypofteficaJ protein RJ23468 9.1 
446715 AB37735 Hs.173919 ESTs, Modoatefrsimaar to 2N91 _HUMANZ9.0 

408771 AW732573 Ks.47584 potassium voftag^alBd channel, delayed 93 

437021 AI076089 Hs292239 ESTs 93 

428479 Y0Q272 Hs.184672 celdMsfcwcycte2:G1toSandG2to 83 

428839 AI767756 Hs323Q2 Homo sapiens cDNAFU14814 fis, done HI 83 

402408 NA NMJ)30920*itosaptoriypod)eS(alpro 83 

418601 AA279490 Hs36368 cafcnegn 83 
426327 W03242 Hs.44698 Homo sapiens done TCCCTA00151 mRNAsequ83 

419519 AI198719 Hs.176376 ESTs a8 

440621 AW29HE4 Hs.150434 ESTs 83 

446142 AI754693 Hs.145968 ESTs 83 

418196 AI745649 Hs26549 KIAA1708 protein 8.7 

447178 AW594641 Hs.192417 ESTs 87 
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427585 D31152 Hs.179729 coIbgeatypeX, a^1 (SchmkJmeteph 8.6 
415857 AA866115 Hs.127797 Homo sapiens cONAFUl 1381 fis, done HE 8.6 
435061 AI651474 Hs.163944 ESTs 8.6 
431374 BE258S32 Hs251871 OTP synthase 8.4 
5 417866 AW0679O3 Hs*2772 coBagen, type XI, alpha 1 8.4 
437211 AA382207 Hs3509 eco^ viral integ^ site 28 8.3 
437751 AA767373 H&356G9 ESTs, Moderately sirnflar to ALU1 JUJMANA83 
423887 ALO8Q207 Hs.134585 DKFZP434G232 protein 8.2 
440941 BE268362 Hs7535 COBWte pmteh 8.2 

10 429859 NM_007050H&225952 protein tyrosine phosphatase, receptor! 8.2 

410193 AJ132592 H&59757 2tnc finger protein 281 &2 

431725 X65724 K&2839 Nome disease (rjseudogtoma) 8.1 

446258 AI283476 Hs263478 ESTs 8.1 

416747 AW876523 Hs.15929 hypofte&al protein FU12910 8.1 

15 434424 AI811202 H&32S335 Hc^ sapiens cDKA:FU23523te, done L 8.1 

421650 AA781795 H&122587 ESTs 8.0 . 

429534 AW976967 Hs.163327 ESTs.Weatys^to2109260ABcefl 8.0 

457465 AW301344 Hs.122908 DNArepGca&n factor 8.0 

427961 AW293165 Hs.143134 ESTs 8.0 

20 436481 AA379597 Hs£199 H^150piotehsirn^toUb^ftnHX3n 8.0 

418216 AA662240 Hs283099 AF15q14proteh 8.0 

418250 U29926 Hs.83918 ao^nosine monophosphate deaminase 73 

400285 NA EosControl 73 

401464 AFQ39241 H&9028 hfek3f»o^ace^ase5 73 

25 407242 M18728 gbifamannaBpecfccrossreadlrtga^ 7 A 

472232 D43945 Hs.113274 transcription {actor EC 7Z 

454024 AA9S3527 H&293907 hypo&e^protefr»FU23403 7.8 

444542 AI161293 H&280380 anhopepOdase 7 A 

436396 AI683487 Hs.152213 wirgtess^rVl^tritegra^sitefainl 7.7 

30 437204 AL110216 Hs.12285 ESTs, WeaWy state to 155214 saltan/ 7.6 

408805 K69912 Hs.48269 vaccinia related kinase 1 7.6 

437207 T27503 Hs.15929 hypofteScalpfo1einaJ12910 7.6 

442818 AKD01741 Hs3739 riypc^hetkaiprotenRJ10879 7.6 

426283 NftLQ03937Hs.169139 kyrajrenhas*(L-kyrajr^^ 7.5 

35 424687 J0507O Hs.151738 [ratnxrnetaBo(notei^9(a^finaseB 7.5 

446315 NM.0162S3HS.14770 r^fog integrator 2 7JS 

433426 H69125 Hs.133525 ESTs 7J5 

406639 M97711 gbtfuman T-ceB receptor (V beta 18.1, J 7.5 

420077 AW512260 Hs37767 ESTs 7.4 

40 457332 AA961694 Hs.105187 ktnestnprotel)9gene 7.4 

422938 NM.001809Hs.1594 cefrtromere protein A (17kD) 7.4 
447555 A1391662 Hs.160963 Homo sapiens, done MfX^2318, rnRitt, com7.4 
444618 AVG53785 Hs.173334 EU.-RELATED RNA POLYMERASE It, ELONGATO7 J 

410361 BE391804 Hs£2661 p^anytateb^prote^t^nen^nvv 73 

45 400268 NA rM.003292i^sa^rjanstocatedprorn7^ 

439509 AF086332 Hs5B314 ESTs 73 

407771 AL138272 Hs£2713 ESTs 73 

407202 N58172 Hs.109370 ESTs 73 

433096 AU0768Q3 Hs282975 caftxaytesterase2(htesCne, Over) 72 

50 422094 AF129535 Hs272027 F-baxonJypro^n5 ' 7.1 

430832 A1073913 Hs.100686 ESTs, WeaWy sNtarto JE0350 Anterior 7.1 
430287 AW182459 Ks.125759 ESTs, WeaWy sfrnOar to LHJ5 JflJMW IHJKE7.0 

423739 AA398155 HsS7600 ESTs 73 
448212 AM75858 gfcto37d07jr1 HCLCGAPJXL1 Homosapiens7.0 

55 407277 AW170035 Hs326736 Honx>sap^ breast cancer an^nrW-BR 73 

454440 BE062906 Hs28338 WAA1546 protein 7.0 

444783 AKD01468 Hs.62180 anQfo(Drosq)hia Soaps homotogX 7.0 

421373 AA808229 Hs.167771 ESTs S3 

431960 AW241821 H&3W9Z7 c&IA 6,9 

60 424704 AC63293 Hs.152096 cytoc*roraP450,sub^tU(^^ 64 

449517 AW500106 Hs23643 serirrertireorrfreprote^ 6\8 

439840 AW449211 Hs.105445 GD^fan^ receptor alpha 1 63 

414080 AA135257 Ks.47783 Baggrassheryniptanagene 63 

441243 AI767056 Hs.193002 ESTs 6.7 

65 408380 AF123050 Hs.44532 cfiuWquain 67 

422956 BE545072 Hs.122579 rrypctoefcal protein FU10461 6.7 

446651 AA393907 H&97179 ESTs 6.7 
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419839 U24577 Hs.93304 phosphoHpase A2, group VII (platelete 6.7 

437740 AA810265 Hs.122915 ESTs 67 

421582 AI910275 Hs.1406 trefo* factor 1 (pS2) 6.7 

427356 AW023482 Hs>97849 ESTs 6 J 

5 429597 NMJ0O3816H&2442 adfeWegnnarrfmeialkxTO^ 63 

422634 NM.016010HS.118821 CG« protein 6.6 

421072 AI215069 Hs.89113 ESTs 6.5 

427718 M798680 Ks 75933 ESTs 6.5 

411000 N40449 H&201619 ESTs, WeaWy similar to S36383^B4B pro 63 

10 449343 AI151418 H&272458 pratesi phosphatase 3 (fanner* 2B), cat 6.4 

400757 NM_001898Hs.123114 cystafinSN 6.4 

447164 AF026941 Hs.17518 Homo sapiens cfe5 mRNA, partial sequence 6.4 

456938 X52509 Hs.161640 tyrosone ainhotransferase 6.4 

418648 A1820961 Hs.193465 ESTs 6.4 

15 424902 MUL0Q3866HS.153687 toositolptrfyjtosphato^ 6.4 

'452838 U65011 Hs*30743 preferenfialy exceed anflgen h mela 6.4 

439452 AA918317 Hs^7987 B^OlJ?ymphoiTia11B (zinc ^ pro 6.4 

407266 AJ235664 gbi^saj)leiBrnRNAfiv^nc{|k)b^ 6.3 

411078 AI222020 Hs.182364 CccoaCnsp 6.3 

20 433001 AF217513 Hs3799Q5 done HQD310 PRO0310p1 6.3 
434340 AI193043 Hs.128685 ESTs, Weakly simlar to T1 7226 hypofoefl , 6J2 

429503 AA394183 H&26873 ESTs S2 

402578 C10OH34^2117372h*|p65981 tatty ac 6-2 

409646 AW161391 Hs709 deoxycyfitfineUnase 6.1 

25 430447 W17064 Hs332840 SWl^F related, natraassodafed,acfi 6.1 

432415 T16971 Hs389014 ESTs. WeaHysfcnfar to A43932 mucin 2p 6.1 

443709 AI0a2692 Hs.134662 ESTs 6.1 

420929 AI694143 HsJ296251 programmed ceB deaft 4 6.1 

428368 BE440042 Hs33326 rnairixnteIaI!oproteir^ ' 6.1 

30 428248 AI126772 Hs.40479 ESTs 6.0 

420344 BE463721 Hs37101 putafive G protebcoupled receptor 63 

453382 U23752 Hs32964 SRY(sexdetemiir^regkinY)-box11 63 

425397 J04088 Hs.156346 topoteomerase (DMA) D alpha (170kD) 64) 

418007 M13509 Hs33169 inattcinetaBopirtehasel (MMP1; inters 63 

35 428565 AB007863 Hs.185140 WAA0403 protein 6.0 

437608 AA761605 HsJ292308 ESTs, Weakly simBar to ALU1_HUMAN ALU S63 

427408 AA583206 H&2156 RAR-ietated orphan nxeptor A 6j0 

406687 M31126 Hfc272620 matt* melafioprotehase 11 <MMP11;stro 63 

418092 R45154 Hs.106604 ESTs 6.0 

40 447051 AW139130 Hs.160951 ESTs, Weakly simBar to Con1 [H.saptens] 6.0 

441233 AAS72965 Hs.135568 ESTs 63 

432239 X81334 H&2936 matrte meiaDofwote^ase 13 (coflagenase &0 

435106 AA100847 Hs.193380 ESTs, Highly simBar to AF174600 1 F-box 53 

435525 A1831297 Hs.123310 ESTs 5.9 

45 458809 AW972512 H&20985 sm3essociated pojypepfide, 30kD 53 
410785 AW803341 gMl2-*JM007&090300^^^ 

422576 BE548555 Hs.118554 CGL83 protein 53 

451398 AI793124 Hs.144479 ESTs 55 

441881 AW968904 Hs.179566 hypotiefical protein FU22624 53 

50 412022 AI005043 rfe24143 WistottAWitt) syndr^ 53 

416636 N32536 Hs.42645 sofeitecanierfainily 16(niorK)car^ 54 

447350 AJ375572 Hs.172634 ESTs 53 

434094 AA305599 rte2382D5 hypome^i>roteinPRO2013 54 

409151 AA306105 H&50785 SE(^V85ktotraffiddrtgpiotBm(S.c 53 

55 448807 AI571940 Ks.7549 ESTs 53 

452281 T93500 H&28792 Homo sapiens cDNAFU11041 8s,donePL 5.8 

421281 AC99139 Ha.17517 ESTs 53 

430361 AI033965 H&239926 sleroM>MTJemy»oxktase4te 53 

400289 XD7820 Hs2258 matte inetallopfDtehase 10 (MMP10;str 5.7 

60 440527 AV657117 Hs.184164 ESTs. Mort^shto to S65657 alpha 5.7 

434674 AA831679 Hs.136985 ESTs 5.7 

'426320 W47595 Hs.168300 trareformlriggra^ factor, beta 2 5.7 

452401 NMJ»7115H3^9352 tumor necrosis factor, alpha^dUcedpro 5.7 

448663 BE614599 Hs.105823 hypc4te8calprotelFiMGC147S7 5.7 

65 438199 AW016531 Hs.122147 ESTs 5.7 

446203 Z47553 Hs.14286 flavin contain^ monoowygenase 5 5.7 

428336 AA503115 Hs.183752 microserrihcproteh, beta- 5.6 
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430379 AF134149 Hs.24G395 potassium ctennelsubfenrilyKmBntoe 5.6 
422835 BE218705 Hs.121378 metaftMneliv^IkB 5, tes8&«pedDc 5.6 
444758 AL044878 Hs.11899 SJiyAoxy^ra^^ 5.6 
443426 AFD9815S Hs3329 chromosome 20 open reading frame 1 5.6 
5 400301 XD3635 Hs.1657 estrogen receptor 1 5.6 
447078 AW885727 Hjl301570 ESTs 5.6 
432015 AL157504 Hs.159115 Homo sapiens rnr^cDNADKF7p5KO0724{B3 
438691 AA906288 Hs£12184 ESTs 5.5 
439809 R41398 Hs.101774 hypomeOcal protein FU23045 53 

10 415786 AW4191S6 H&257924 hypcftebca) proteh RJ13782 53 
456373 BE247706 Hs.89751 membrane-sparring 4-domafo%st&femtyA 53 
401645 NA C16001440^gi)123aJ704|gb|AA652fi90.HAF355 
437967 BE277414 Hs3947 mdtrarsformlngcfKXjgene^^^ 53 
445885 AI734008 Hs.127699 WAA1603 protein 5.4 

15 439138 AI742605 Hs.193696 ESTs 54 
440270 NMJM5986HS.7120 cytokbie receptor^ motecule 9 54 
437536 X91221 Hs.144465 ESTs 5.4 
438167 R28363 HsJ24286 ESTs 5.4 
452741 BE392914 H&30503 Hoir© sapiens cDNAFTJ1 1344 done PL 5.4 

20 426214 H59846 Hs.128355 ESTs, Moderately amifer to A!^^ 

413554 AA319146 Hs.75426 seoetogranh II (cfcorrwgiamnC) 5.4 
422857 L32137 Hs.1584 carBage oGgomerlc matrix protein(COM 5.4 
434263 N34895 Hs.44648 ESTs 5.4 
446382 AW205168 Hs.150823 ESTs 5.4 

25 422406 APD25441 Hs.1 16206 Opa-trrteracahg proteh 5 53 
438321 AA576635 Hs.6153 CGUSprotetn 53 
418310 AA814100 Hs36693 ESTs 53 
419625 U91616 H&91640 nuclear factor of kappa DghtpotypepSd 53 
450701 H39960 Hs288467 Homo sapiens cDNAFU12280fis.dbne MA 53 

30 445900 AF070526 Hs.13429 Homo sapiens dons 24787 mRNA sequence 52 
449051 AW961400 H&333526 KBUrecep&rtyrDskiekir^fc^rb^ 52 
418758 AW959311 Hs.172012 rrypoft8^r^roteinDKFZp434J^ 52 
431070 AW408164 H&249184 transcn^0onfactor19(SC1) 52 
417079 U65590 Hs*1134 irrierleukhl receptor antagonist 5.2 

35 421328 AF013758 Hs.109643 poiyadenyiaie binding prtehtateracfin 5.2 
428804 AKD00713 Hs.193736 hypo9ieo^pmtehRJ^0706 5.2 
427427 ARJ77345 Hs.177936 ESTs 52 
403485 C3001813*5i|12737279lrelp(P - 0121B3.1| k 5.2 

422168 AA586894 Hs.1 12408 S100 calcfum-bimiing proteh A7 (psorfas 5.1 

40 421937 AI878&57 Hs.109706 ternatoloojcaJarrire^^ 5.1 
426752 X69490 Hs.172004 Bm 5.1 
453310 X70697 Hs353 soluiBcanler family 6 (neurotransn^te 5.1 
423198 M81933 Hs.1634 oef cMdbn cyete 25A 5.1 
412281 AIB10054 Hs.14119 ESTs 5.1 

45 447513 AW955776 Hs313500 EST^Moc«rater/slmBartoALU7jnJI^ 

453931 AL121278 H&25144 ESTs 5.1 
404347 TajgetExon 5.1 

431808 M30703 H&270833 arnphfceguSn (schwarraTta^errvedor^ 5.1 
429113 D28235 Hs.196384 pcostagtetafrendoperox^ 5.1 

50 436291 BE568452 Hs3101 proteh regulator ofcytoKnesfel 5.1 
450603 R43646 Hs.12422 ESTs &1 
434725 AKD00796 Hs.4104 rypofteik^ protein 5l0 
435981 H74319 Hs.188620 ESTs 5.0 
407376 AA993138 Hs.142287 ESTs, Weakly 3im8ar to AUJF_HUMAND)! 5D 

55 431689 AA305688 H&267695 UlWSafctetaGlcNAcbete^^^ &0 
405348 NA C7W)1664^12698061|dbpAB21B49.1|(AB 5J) 

436196 AKD01084 Hs333498 Honto sapiens cDNARJ10222ffe, done ^ SjO 
437065 AL036450 Hs.103238 ESTs 5i0 
410196 AIS36442 Hs39838 hypctofcal protein FU108TO 5\0 

60 429412 NM.006235HS2407 POUctormun, class 2, assorting fector 5j0 
446619 AU076643 Hs313 secreted ptosritoprotBh 1 (os^ 4.9 

403329 NA Target Exon 43 

442875 BE623003 H&23625 Horr»sapterodoreT(XXn'AfJ0142fnRNAse^ 
442441 AI820662 Hs.129598 ESTs 4^ 

65 430375 AW371048 Ks£3758 WMstorrabmOy, member H 43 
424128 AW966163 gbcEST378236 MAGE resequences, MAGI Homo4J9 

408873 ALO46017 Hs.182278 caferioo^2(r>ho3prK>^^ A3 
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407910 AA650274 Hs.41296 Rbronecttn leucine rich transrnembranep AS 

432606 NNL002104Hs3066 giarizymeK (serine protease, granzyme 3; 4,9 

453204 R10799 Hs.191990 ESTs 4.8 

452020 AA722012 Hs.256757 ESTs, Weakly similar to AT2/LHUMAN POTEN4.8 

5 449048 245051 Hs22920 sirnBarto S68401 (cattle) glucose Indue 4.8 

408369 R38438 Hs.182575 scto earner farr^l 5 (HWtransp^ 4.8 

431645 AF078849 Hs^66483 dyneli light chairvA 4.8 

423575 C18863 Hs.163443 Har»sapterttcDr^aJ11576fis,doneHE 4.8 

444246 K93281 Hs.10710 hypofte&al protein FU20417 4.8 

10 421524 AA312082. Hs.105445 GDWfamiry receptor alpha 1 4.8 

452827 AI571835 H&55468 ESTs 4.8 

414222 AL135173 H&B78 sorb^ dehydrogenase 4.8 

456086 AL161999 Hs.77324 eukaryofctraretaBon termination tacto 4.8 

419078 M93119 Ha.89584 Irauftiofra-assodated 1 4.8 

15 418973 AA233056 Hs.191518 ESTs 4.8 

447033 AB57412 Hs.157601 ESTs 4.8 

451621 AI879148 Hs26770 fettyacW binding protein 7, brain 4.7 

419968 X04430 Ha33913 hterfeuWn 6 (toterfer^ beta 2) 4.7 

424326 NML014479H3.145296 dfcWegrtn protease 4.7 

20 431565 BE242803 rfeL262823 hypofiieficaJ protein RJ10326 4.7 

429294 AA095971 Hs.198793 Homo sapiens cDNA:FLI22463fis, done H 47 

416814 AW192307 Hs.80042 dofcrry^-Gk^ar^lcKAc2^ 4.7 

439897 NftLW531Wfe6763 KIAAD942 protein 4.7 

429687 AI675749 H&211608 nucteoportn 153W) 4.7 

25 422880 AF228704 Hs.121524 glutathione reductase 4.7 
405801 MU»G390-itarno sapfens cfKuwdererraa (Ra4.6 

432435 BE218886 H&28207O ESTs 4.6 

439544 W26354 >te^8891 bn)o8)e^prc^aJ11360;arterrdsp 4.6 

425354 1^2027 Hs.155935 complement cornponent 3a receptor 1 4.6 

30 436027 AI864G53 Hs39972 ESTs, WeaMy similar to 138588 reverse t 4.6 

424623 AW963062 Ha337404 ESTs 4.6 

403366 NA Target Exon 4.6 

402542 Target Exon AJo 

450193 AI916071 Hs.15607 Homo sapiens Fanconl anemia cornp!ernentat4.6 

35 411678 A1907114 HsJ1465 squatene epoxEdase 4.6 

456844 AI264155 Hs.152981 CDP-^Bacy^cerol syrrmase (phosphafida AB 

448072 AW59B05 H&24908 ESTs 4.5 

408045 AW138959 H&245123 ESTs 45 

423782 AI472209 Hs323117 ESTs A3 

40 447388 AW630534 Hs.76277 Homo sapiens, clone M6CS381. mRNA, cornp4-5 

448140 AF146761 rb20450 BC&Mte membrane proteto precursor 45 

452561 AI892181 Hs.49169 WAA1634 proten AS 

425331 AW962128 gb£ST374201 MAGE resequences, MAGG Homo4.5 

428801 AW277121 Hsl254881 ESTs AJS 

45 428500 AK15395 Hs.184641 fatty acid desaturase 2 A3 

426075 AW513691 H&270149 ESTs, WeaMy similar to 21 09260ABceO 4.4 

437259 AJ377755 Hs.120695 ESTs 4.4 

400409 AF153341 H&283954 Honasapters'wfcgedhe^^ 4.4 

412863 AA121673 H&59757 zinc linger protein 281 4.4 

50 426989 A1815206 Hs£9395 ESTs 4.4 

401866 Target Exon 4.4 

418819 AA228776 Hs.191721 ESTs 4.4 

406348 Target Exon 4.4 

412138 AW895387 gb*W4^NfJQ3840030<M57-c10 NN0038 Homo4.4 

55 428550 AW297880 Ha98661 ESTs 4.4 

411743 AW8G2214 gb^4-CT0361^1299W-^CTD361 Homo4.4 

429966 BE081342 H&2B3037 HSPC039 protein 4.4 

423291 NHJX)4129Hs.126590 guanytato cyclase 1, soluble, beta 2 4.4 

423456 AL1 10151 Hs.128797 DKFZP586D0824 protein 4.4 

60 452190 H26735 H&91668 Homo sapiens clone PP1498 unknown mRNA4.4 

424871 NMJD04525HS.153595 lowA^Dpoprofear^^ 43 

429575 AA706003 Hs49387 ESTs A3 

429922 Z97630 H&226117 HI hfctone family, member 0 43 

421379 Y15221 Hs.103982 srnaDino^idbtecytokire 43 

65 400300 X03363 r©*2 receptor tyrosine Wnase(oerW32, 43 

437258 ALD41243 Hs.174104 ESTs 43 

446595 T57448 Hs.15467 hyrJC*etk^rxoteInHJ20725 43 
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403011 NA BmX)(»0215330^rotebl8sen^^ 43 

419055 AI385384 Hs.11571 Homo sapiens cDNARJI 1570 ketone HE 43 



418661 NKL001949HS.1189 E2FtnmscnpBon factor 3 43 

407786 AA687538 Hs38972 tetraspanl 43 

5 429183 AB014604 Hs.197955 WAA0704 proteh 43 

442914 AW188551 Hs39519 hypoiheScalpfotoFU14007 4.3 

441029 AJ0917S5 Hs.179246 ESTs 43 

452194 AB94413 Hs332649 ofoctwy receptor, family X subtarnSy 43 

414821 M63835 HsJ7424 Fc fragment of lgG,Wghafflr«y la, re M 

10 410102 AW248508 H&279727 Homo sapiens cONAFU14035fis, done HE 42 
452110 T47667 Ha28Q05 Homo sapiens cDNAFUl 1309 fis, done PL 4.2 

442007 AA301116 Hs.142838 nuc^brpho3phopiD^Nopp34 A2 

417318 AW953937 Hs.12891 ESTs A2 

431818 AW510444 Hs.191705 ESTs, Weakly sirrta to T471 84 hypofoett A2 

15 443646 AJ085196 Hs.164226 ESTs A2 

419169 AW851980 Hjl262346 ESTs, Weak* simaar to S72482hypofrt88 A2 

446839 BBQ91926 Hs.16244 mRoficspfinolecoBeo^coBieiatedpiDt A2 

42326 AL0394G2 Hs.125783 DEME-6 protein A2 

432116 AA9Q2953 Hs308538 ESTs 42 

20 409038 T97490 Hs30002 smaIhAiclbtecytoHr» 42 

445625 BE246743 H&288529 hypodefoal piotebi HJ22635 A2 

425139 AW63048B Ks325820 protease, serins, 23 42 

447397 BE247676 Hs.18442 E-1 enzyme 42 

410166 AK001276 Hs>59346 hypothetical protein RJ10514 4.1 

25 437295 AW779318 Hs38417 ESTs 4.1 

430486 BE082109 H&241551 chloride channel, calcium ac8vated,fam 4.1 

441790 AW294909 Hs.132208 ESTs 4.1 

410129 BE244074 Hs38831 regu^dFas^no\iceda|»^osis 4.1 

427521 AW973352 H&2SQS5 ESTs 4.1 

30 425247 NMJQ5940HS.155324 matrametalkjproteirtase11 (MMP11;stro 4.1 

412886 AP041163 Hs.74647 Hivt^T^ receptor a(^a^cham 4.1 
441153 BB6282B gfc601336534F1 N1H.MGC.44 Homo sapiens c4.1 

444301 AKD0O136 Hs.10760 asponn (LRR dass 1) 4.1 

426711 AA383471 Hs.180669 conserved gene ampked to osteosarcoma 4.1 

35 405850 m Target Exon 4.1 

440283 AI732892 Hs.190489 ESTs 4.0 

432441 AW292425 Hs.163484 ESTs 4.0 

400284 NA estrogen receptor 1 4.0 

417341 N91453 Hs.102987 ESTs 4.0 

40 429732 U20158 Hsw2488 lymprwcyte cytosolic protein 2 (SH2doma 4.0 

411383 AW797437 Hs*69771 Wactor, properdin 4.0 

425704 U79293 Hs.159264 Human done 23948 mRNA sequence 4.0 

419594 AA013051 Hs31417 topofeomerase(DMA) U bkxftig protein 4.0 

419092 J05581 H&89603 much 1, transmenArane 4.0 

45 443147 AKB4351 Hs.19030 ESTs 4.0 

406633 AW963372 Hs.46677 PRQ2000 prctem 4.0 

433404 T32982 Ks.102720 ESTs 4.0 

421506 BE302796 Hs.105097 fhymbftia kinase 1, soluble AS) 

417900 BE250127 Hs32906 CDC20 (eel (flvtsion cycle 20,Swcerevi 33 

50 414602 AW630088 Hs.76550 Homo sapiens mf^cDNADKFZp564B1264(e33 

413762 AW411479 Hs348 FK50&4mdfcg protein 4 (59kD) 33 

404580 NH,014112^Homo sapiens Mctahinoptiab 33 

452046 AB018345 HsJZ7657 WAA08Q2 protein 33 
4595B7 AA031956 gbzk15e04.s1 Soares_pregjiantjutenisJ^a9 

55 416658 U03272 HsJ9432 n^2(conp^contrachvalaia 19 

426G47 AA243464 HsJ294101 pre-&cel leukemia transection fador 3.9 

429353 AL117406 Hs^001Q2 ATP-Wwflng cassette transporter MRP8 33 

419038 AW134324 Hs.190325 ESTs 34 

418918 X07871 Hs39476 CD2 ara^en (p50), sheep red blood eel 33 

60 421977 W94197 Hs.110165 ftosoma) protein 126 homotog 33 

442567 AI201183 Hs.130251 ESTs 19 

421168 AF182Z77 Hs330780 cytodmm)eP450,subfa^nB(pr)emi}ar 33 
431701 AW935490 Hs.14658 Human crirarwsome 5qttt clone 5G8 mRNA33 

418526 BE019Q20 Hs35838 sotute canter tarnfly 16 (rroriocaT^ 33 

65 414998 NKL002543HS.77729 <ni(fisedkwdens^^proteh(lec&i 33 

422790 AAffi)9875 H&25933 ESTs 35 

419741 NM_007019Hs330G2 uokjuftto (arrterprotem E2-C 33 
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430017 AA263172 Hs.35 prote«tyro^ phosphatase, norwecept 3.9 

458814 AW98957 Hs.170861 ESTs, Weakly similarto Z195_HUMAN ZWC 3.8 

428514 AW236861 Hs.199139 ESTs 3.8 

434521 NM.002267Hs.3886 karyophertn alpha 3 {Importin alpha 4) 33 

5 409425 IM0462 Hs34452 zinc finger protein, sttemttyl A. 1(Bc 33 

439500 BB65647 Hs.74899 hypoftefcaJ proteh FU12820 33 

424028 AF055084 Hs.153692 Homo sapiens cDNAFU14354fe,cioneY7 33 

400021 AFRCconfaol-HUMlS6F3AM97935JM 33 

453403 BE466639 Hsl61779 HomosapfenscDNARJ13591fe,donea 3.8 

10 445941 AI267371 Hs.172636 ESTs 33 

434378 AA631739 H&335440 EST 33 

429220 AW2072D6 Hs.136319 ESTs 33 

439176 AM46444 Hs.190394 ESTs,WeaHysim8artoB28096Bne-1 pr 33 

401045 C11001883*:gi|6753278tidlNP_033938.1|c 18 

15 430178 AW449612 Hs.152475 ESTs 33 

423397 NKL001838HS.1652 chenioWr»e (OC mofi) receptor 7 33 

447630 AI660149 Hs.44865 fymptotteflhaficer^^^ 33 

436391 AJ227892 Hs.146274 ESTs 3.8 

413011 AW068115 Hs321 blglycan 33 

20 422121 AI767949 Hs. 179833 ESTs 33 

452268 NH_003512HsJ28777 H2Ahfctorofafiuty» member L 33 

427811 M81057 Hs.180884 cartraxypepfidase B1 (tissue) 33 

415579 AA1G5232 H&222069 ESTs 33 
437330 AL353944 Hs50115 Homo sapiens mRNA; cDNA DKFZp761 J11 12 (f 33 

25 427122 AW057736 Hs.323910 HB*2 receptor tyrostoeHnase((Hsn>*2, 37 

400286 NA C160K)922gip499103|pir|JT2Q903 hypothe 3.7 

420281 AI623693 Hs.191533 ESTs 37 

413926 AW90Q992 HsS3796 DKFZP586D2223 protein 3.7 

417541 AI992191 Hs.180040 typofoefcd protein RJ22439 37 

30 426172 AA371307 Hs.125056 ESTs 37 

429638 A1916662 H&211577 tinecfin 1 (tinesto receptor) 37 

457001 J03258 HsJ2062 v3arTibD(1^o*yoW^arnlnD3)re 37 



424109 AW406878 gbAJIW-BU^dg^OrOWUJlrl NIH_MGC_373.7 



417022 NMJr14737Hs3Q905 Rasasstxiatton(RaK3DS^^)doma«farn 3.7 

35 436222 AI208737 Hs.122810 H<BTOsaptenscDNAFLJ11489fis, clone HE 17 

430448 A1633553 Hs.13303 Homo sapiens cONA: RJ217B4 fe, clone H 3.7 

432729 AKD00292 H&278732 hypofte^ protein FU20285 17 

413916 N49813 Hs75615 apoBpoprote&i C-fl 3.7 

421662 NM.014141HS.106552 ceflrecogrt2onrr«JeculeCaspr2 37 

40 441633 AW958544 Hs. 112242 normal mucosa of esophagus specie 1 17 

408761 AA057264 HsJ88936 ESTs,WeaMysirrtfarto(de^r»tava 37 

406153 Target Exon 17 

445563 AW873606 Hs.149008 ESTs 17 

453464 A1884911 H&32989 receptor (caJcbinin)acBVfynioo^ 17 

45 448918 AB011152 Hsl22572 KIAA0580 protein 17 

413936 AF113676 H&297681 sertoe (or cysteine) proteinase Inhtoto 16 

448069 U76248 H&20491 seveninabserrfla{DTOsophea)hor^2 16 

453313 BE005771 Hs.153746 rtypofteU proteh RJ22490 16 

425234 AW152225 Hs.165909 ESTs, WeaWysbnterto 138022 hypoflieS 16 

50 410941 X98654 H&93837 phosphafelyflnositDltra^ 16 

402397 AF188625 Hs.189507 phospho4>aseA2> group (ID 16 

430378 Z29572 Hs3556 tumor necrosis fector receptor superfami 16 

448106 AI800470 Hs.171941 ESTs 16 

428431 NM-00O458HS.169853 lranscr|p8ontactor^hepfi^c;Lf^B3; 16 

55 431843 AA516420 Hs.183526 ESTs, WeaWystoater to 138022 hypotea 33 
426878 BE069341 a>«V3-BT0381^0<MI7l<^BT0381 Homo33 

434C81 AW024973 Hs£83673 NPD009protem 16 

445292 AV653264 Hs.13982 Hon» sapiens cDNARJ14666fis,ctor»OT 16 

452101 TG0298 Ks.10844 Homo sapiens cDKA FU14476fe, done MA 16 

60 427581 NMJH4788HS.179703 WAA0129 gene product 16 

409047 AW961434 Hs31539 ESTs 16 

416820 NMJQ00402Hs3Q206 gtucose^-prtosphateo^hydrogenase 33 

410386 W26187 H&3327 Honw sapiens cDNA: BJ22219 ketone H 16 

440516 S42303 Hs.161 cadherin2, type 1,f+<adhefti (neuronal 16 

65 434360 AW015415 Hs.127780 ESTs 16 

428970 BE276891 Hs.194691 refoofc acid induced 3 16 

415079 R43179 Hs.22895 hypothetical protein RJ23548 33 



328 



WO 02/059377 



PCT/US02/02242 



409819 AKD01015 Hs55220 BCL2^ssociated alhanogene 2 3.6 

430044 AA464510 Hs.152812 ESTs 3.6 

430829 AW451999 Hs.194024 ESTs 3.6 

434224 AA380731 Hs.84 toterfeukin 2 receptor, gamma (severe co 35 

5 439247 AFD88020 Hs.46767 EST 3.6 

431542 H63O10 Hs5740 ESTs 35 

430713 AA3S1647 H&2642 etferyofc translation elongation factor 15 

434988 A1418055 Hs.1 61160 ESTs 35 

437748 AF234882 H&5814 suppression of Hira»fearftty 7 35 

10 418322 AA284166 HS.84113 cydb^epeno^ kinase IrM^ 3 35 

439569 AW6Q2166 Hs£22399 CEGP1 probfc 35 
439583 AB07673 gfciLflT152-O8039WK)4 BT152 Homo sapfen3.5 

403212 NMJ)19595itomo sapiens totersecfln 2 (IT 35 

409C99 AK000725 Hs5G579 hypdnetelprDtebFU20718 35 
15 453968 AA847843 Hs.62711 Homo sapiens, done IMAGE3351295, mRNA35 

436338 W92147 Hs.1 18394 ESTs 35 

422890 Z43784 Hs.75693 ankyrin 3, node of Ranvter (ankyrin G) 35 
442295 AI8Z7248 Hs£24398 Homo sapiens cDNAFUl 1469 fis, done HE 35 

417975 AA641836 H&30085 hypoftefcal prolyl FLJ23186 35 

20 433730 AKD02135 Hs3542 hypc»ie8ca)prDtehRJ11273 ■ 35 

439926 AW014875 Hs.137007 ESTs 35 

445873 AA25097O Hs251946 pcirytAHAwfag protein, cytopbsmic 1-1 3.4 

410153 BE311926 Hs.1 5830 hypoftettcal protein RJ12691 3.4 
422128 AW881145 gbtfVO<JT0O33<)1 0400-182^ OT0033 Homo 3.4 

25 414921 BE390551 HsJ7628 sferddogenfc acute n^^^ 3.4 

441134 W2S092 Hs.7678 cellular le^ackMM^protehl 3.4 

444564 AI167877 Hs.143716 ESTs 14 

402470 NA Target Exon 14 

418120 AA213437 Hs.192249 ESTs 14 

30 422414 AW875237 Hs.13701 ESTs 3.4 

433345 AK&1545 Hs.152982 hypoftefeaJ protein FU13117 14 

409213 U61412 Hs51133 PTKBpnrteto tyrosine kinase 6 14 

422611 AA158177 Hs.1 18722 tocreyifransferase 8 (alpha (1,6) faoosy 14 

423554 M90516 Hs.1674 gtotarrdne^rudose^^^ 14 

35 442619 AA447492 Hs30183 ESTs, Weakr/slrrflaT to AF1647931 prate 14 
402359 NA 019001991^126^1 11bb|AAKD0751.1|AF2l4 

439398 AA284267 Hs£21504 ESTs 14 

415208 F01Q20 Hs.172004 fife 14 

452853 AA812633 Hs.10845 ESTs 14 

40 429345 R11141 Hs.199695 hypo&ie&al protein 14 

449027 AJ271216 H&22880 dipepfldylpepedase UI 3.4 

412115 AKD01763 Hs.73239 hypothetical piote h RJ10901 14 

432180 Y18418 Hs272S22 RuvB (E col riornotoo^fflte 1 3.4 

428977 AKD01404 Hs.194698 cycBnB2 3.4 

45 431611 U58766 rte364428 tissue specific transplants arisen 14 
418286 AA622528 H&319825 Homo sapiens, done 1MAGE3616574, mRNAA4 

436895 AFQ37335 Hs.5338 carbc^anhytfraseXIl fjun oranty enH 14 

443378 AW392550 HsSZBO p iuttMbu i w (prosome,macn^)sufacra, 14 

428450 NH_014791Hs,184339 KIAA0175gereprodud 13 

50 449571 AW016812 Hs^0Q266 ESTs 13 

412777 AJ335773 Hs270123 ESTs 13 

420542 NM_000505Hs.1321 coaguiato factor XII (Hagemanfedor) 13 

412754 AW160375 Ks.74565 amyloid beta (M) precursorOB protein 13 

418327 U78370 Hs.84136 r^reo^homeodbfnalnlrans^ 13 

55 449065 AI627393 H&2S8998 ESTs. WeaWysbrflajto WghmobJEygr 13 

425999 AW513051 Hs332981 ESTs, Wea«ysintfaTtoI38022riypom^ 13 

430280 AA361258 H&237B68 toterteukin 7 receptor 13 

407777 AA161071 HsJMBS Sfuatsneepoxttase 13 

426516 BE262660 Hs.170197 glutemlCHiQcateceectaan^ 13 

60 414361 AI086138 rta204044 ESTs 34 

427080 AW068287 Hs.173466 ras-related(^botultnumtoxhsubsfrate 13 

426429 X73114 H&169849 tnyosi»4ferftag protein C.stairtype 13 

446163 AAQ26880 HsJ25252 protacfin receptor 33 

428566 U41763 Hs.184916 dathrm, heavy polypepflcettB 1 13 

65 418641 BE243136 Hs56947 adlsMep^arrtrrriai^^ 33 

436293 AI601188 Hs.120910 ESTs 3,3 

411S7 AA628987 Hs.1 15274 ESTs, Highly similar to IHHjflJMAN H©IAN 13 
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430253 AKD01514 Hs236844 hypometlca!protekiRJ10652 33 

430066 AI929659 H&237825 signal recognBJonpartfcfe 72© 3.3 

436469 AKD01455 Hs£198 Do^ syndrome aflfcalreglwi gene 2 33 

437786 BE142681 Hs.155573 polyrr^asepNA directed), ete 33 

5 444079 K0904B Hs23606 ESTs 33 

457183 H91882 Hs.118569 DvWxntflng protein IDAX (InhibrBon of 33 

431215 AA496078 Hs.121554 HumanDNA sequence tarn done RP11-218C133 

424563 AA446932 Hs.151428 ret finger protein 2 33 

450828 AW270655 Hs.193804 ESTs 33 

10 408652 R43409 . Hs.6829 Homo sapiens mRNA for WAA1644 protein, 33 

445142 AW978484 Hs.93842 Homo s^)^ cDhlV FU22554 fte, done H 33 

426761 AI015709 Hs.172089 r^sapkramRNA;cOMDKF^^ 



439237 AW408158 Hs318893 ESTs, WeaMysimtotoA475B2B-ce«gr 33 

422616 BE300330 Hs.1 18725 seJeroriwsphate synthetase 2 33 

IS 443247 BB614387 Hs333893 oMyc target JPOI 33 

406663 U246B3 Hs3Q2063 taBmirapjobu&i heavy constant mu 33 

434137 AA907734 Hs.124895 ESTs 33 

408877 AA479033 Hs.130315 ESTs, WeatfyshdartDA47582&ceflgr 33 

439101 C01765 Hs38750 hypc4hetfcalprolektFU11526 33 

20 408221 AA912183 Hs.47447 ESTs 33 

447519 LM6258 Hs339665 ESTs 33 

404755 NA Target Exon 3.3 

451871 AI821005 Hs.118599 ESTs 32 

420319 AW40S289 HsJ6593 byprthe^proteto 3J2 



25 430580 AA806105 Hs300697 irrrniunogtobufin heavy constant gamma 3 (G 32 
400202 KA NM^002795**toniosar^rJotBasome{pros^ 
400222 NA NMjDa2082^Htm»sapiemGpj^^ 
425988 BB045B97 H&274454 ESTs, WeaWys^toB8022hypofliefi 32 
458098 BE55Q224 Hs.74170 metaWtowsi 1E (functors!) 32 
30 430589 AJ0O2744 Hs246315 UDP-H^cetyMpha^alactos^ 3J2 
431563 A1027643 Hs.120912 ESTs 32 
442353 BE379594 Hs.49136 ESTs, Moderately simflar to AW7J1 UMAN A 3^ 
422309 U79745 Hs.1 14924 sc4utecajn^forn8y16(nicnocaibax^ 32 
419703 AI793257 Hs.128151 ESTs 32 
35 420380 AA640891 Hs.102405 ESTs 32 
410853 K04588 Hs30469 ESTs 32 
454417 AI244459 Hs.1 10826 fainudec^ repeat caitafHng 9 32 
432745 A1821926 gbort78a)5j(5 NCLCGAPJW Homosapjensl2 

422032 AA476966 Hs.110857 r^rfymeiase (RNA) HI (DhW (firec^ 32 
40 415339 NM.015156HS.78398 WAA0071 proteh 32 
426384 AI472078 Hs303662 ESTs 32 
448030 N30714 Hs325960 rnernbrane-spannlna 4-danatas, subfamily A 32 
418739 AA310964 H&88012 SHP2 interacting transroerrferane adaptor 32 
442053 R35343 Hs£4968 HiJrnanDr^sequem»^cloiieRP1-5^1632 
45 434747 AA837085 Hs^205B5 ESTs 32 
427297 AW292593 Hs334907 Hornosap^doneMGC:t7333,rrtf^oona2 
412228 AW503785 HsJ37S2 coinpleinertconipaieitfpd^ 32 
452304 AAQ25386 Hs.61311 ESTs, WeaMy sMar to S10590 cysteine 32 
453953 AW408337 Hs36972 CD7arrfig8n(p41) 32 
. 50 407758 D50915 Hs38365 WAA0125 gene product 12 
451149 AL047586 Hs.10283 RNAbWrngrnoffl protein 88 32 
430015 AW768399 Hs.1 12157 ESTs 32 
433313 W20128 Hs296039 ESTs 32 
418334 AA319233 H&5521 ESTs 32 
55 450223 AA418204 Hs241493 naturaJWIter^imwreoogn^ serene 32 
454365 AW966728 H&54642 meWontoe adenosyttransierase tl beb 32 
451128 AL118668 gteDKFZp761l0310j1 761 (synonym: harrr/2)33 

417793 AW405434 Hs32575 small nuclear nbc^iBopii^pdypeDt 32 
428027 U22029 Hs334345 (7tMhronieP450,subfarn?yUA (phenobar 32 
60 441197 BE244636 Hs.166 sterol regulatory eternertbli^ 32 
424634 NM.003613HS.151407 carflao^hterrrec^ layer rxoteft 32 
419986 AI345455 Hs.78915 GA4)Jr^ r^rotelntranscft^ 32 

416714 AF283770 Hs.79630 CD79A antigen fimmimoglobulln^ssoclated 32 i 
449465 NAL004380Hsl23598 CREB binding protein (RuUnstetoTayU s 32 
65 422166 W72424 Hs.1 12405 S100 cak*unvbWing protehA9(ca|gran 32 
409079 W87707 Hs32065 IntBrteuWn 6 signal transducer (gp130, 32 
423551 AA327598 H&233785 ESTs 32 
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453553 AA036849 Hs.61829 Homo sapiens cDNA FU12763 Cs, clone NT 32 
442580 AJ733682 Hs.130239 ESTs 35 
458079 AI796870 H&54277 DNA segment on chromosome X (unique) 99232 
425700 AF075292 Hs,159251 Ibrttead box H1 32 
5 417124 BE122762 H&25338 ESTs 32 
407104 S57296 Ha323910 w*Vb2 avian ejytottastfcteukefnQV 12 
442215 AI703172 Hs.129005 ESTs, WeaHyshtfarto21Q926QABcefl 3.1 
430271 T06199 Hs237506 DnaJ (Hsp40) homoteg, subfamily B, membe 3.1 
425317 AW205118 Ha21Q546 WerteuMn 21 rector 3.1 
10 426095 A1278023 Hi89986 ESTs 31 
442313 BE388898 H&8215 hypoftefcal protein RJ1 1307 11 
424709 AL137589 Hs.152149 hypodte^ protein DKFZp434K0410 11 
429671 BE379335 Hs211594 r*oteasome (prosorne, 
432715 AA247152 Hs200483 ESTs» WeaWy sJntotoKlAA1074pfotem 3.1 
15 431574 AW572659 Hs261373 hypolhe8calprotBh(U4340143 3.1 
436876 AI124756 Hs^337 bodtrato dehydrogenase 2 (NADP), mtoc 3.1 
405017 NA TargetExon 3.1 

433805 M706910 Hs.112742 ESTs 11 
437352 AL353957 HsJ284181 rvpc*e8caIprotehDKFZp434P0531 11 
20 430105 X7D297 Ha2540 chcAieigferecspk^nlcofin^ 11 
422083 NM_001141Hs.111256 arachldcfiatB 15^poxygenase. secondlyp 3.1 
413507 BE145360 Hs.190064 ESTs, WeaMystmfer to 138022 hypofoeii 11 
4159© AC67700 H&317584 ESTs 11 
422907 AI879263 H&6986 Human glucose transporter pseudooene 11 
25 425548 AA890023 Hs.1906 prolactin receptor 11 
422999 BE387202 Hs.116638 rwwnetastafccefc 1, proteto (NM23A) 11 
439963 AW247529 Hs£793 ptafeteecfraflhg factor acetyhydrota 11 
453883 AI638516 H&22630 ctfactorrequkedlbrSpltran^ 11 
458021 AB85190 Hs.156089 ESTs.WeaJ^skitotorepresscfpfotBJ 11 
30 418478 U38945 Hs.1174 cycflrHJeperxfertldrK^ 11 
400814 NA TargetExon 11 

402327 Target Bam 11 

416935 AA190712 gbcq987fl)9j1Str8tag8neKeLacels393 11 

439838 AL355722 Hs.106875 Homo sapiens EST from done 35214, M 11 
35 437038 AI571514 Hs.133022 ESTs 31 
449523 NM_C00579Hi54443 d»nio)dne(CCnioffi)iecep1or5 11 
406642 AJ245210 gb^icmsa^0iRNAtetnirmirx#^ 11 

406624 AF052762 gfcHorno sapiens done csneg8-1 Immunogb 11 

421924 BB14514 Hs.108606 coronK acfintindlng protein, 1A 11 
40 414523 AU076633. Hs.76353 serine (or cysteine) proteinase inhfoto 11 
416379 N38857 Hs203933 ESTs 31 
422823 D89974 Hs.121102 vanh2 3*1 
433904 AK99956 Hs208956 ESTs 3*1 
Ae 421904 BE143533 Hs.109309 hypoftetkai proteh FU20035 11 
45 428834 AW899713 Hs^39315 ESTs 11 
436043 AW963838 Hs.168830 HccrosaplerecONAnj12136fis,don8MA 11 
452823 AB012124 H*30696 taiscrfrtai to-8ke5(basfche&c 11 
405381 NA Target Exon 11 

428746 AW503820 Hs.192861 St>B trarso ip Oon factor (SpWU.I r 3.1 
50 435147 AL133731 Hs.4774 Homo sapiens mRNA; cDNA DKFZp761C1712 (G.1 
425782 U66468 Hs.159525 celgiow&iregiila^wiftEFJia^ 11 
423306 W88562 Ks.108198 ESTs 11 
419123 AA234276 H&88253 ESTs 31 
438581 AW977766 Hs292133 ESTs, f^xJerate^slmDar to r78885 serin 11 
55 417105 X60992 H&81226 COS antigen 10 
428361 NM.O15905H5.183858 fransaipfjunar bitermedfary factor 1 3.0 
41780) BE241595 H&82B48 setecSnLflyiTTtoc^ 10 
402606 r^024^itosap^rrypc4reS(^r^ 3.0 

401451 NKL004496 i *terno sap ien s hepatocytenuctelO 

60 421878 AA299652 Hs.111496 Horn saplemcDNARJ116431^ done rE 10 
409518 BE384836 Hs3454 K1AA1821 proteto 10 
416933 BE561850 H&8Q5Q8 sntf nuclear rfborud^^ 3.0 
414324 Y14768 Hs.890 rymprrioorin b& 10 
425081 X74794 Hs.154443 mWaomosome maintenance defcient (a 3D 
65 401519 C15000476^gI|12737279^etlXP_01216l1| 10 

411704 AM99220 Hs.71573 tiypothetol proteto HJ10074 10 
428819 AL135623 Hs.193914 WAA0575 gene product 10 

331 



WO 02/059377 



PCT/US02/02242 



428423 AU076517 Hs.184276 sohite earner family 9 (sodkim/hydro9^ 3.0 

413835 AI272727 H&249163 acid hydroxylase 3.0 

412600 L28824 Ha.74101 spleen tyrostoettiase 3.0 

410491 AA4G5131 Hs.64001 HarosapfensctoftB 25218 mRNA sequence 3.0 

5 433658 L03678 Hs.156110 bwmmogtefaufei k^pa constart 3.0 

427666 AI791495 Hs.180142 cabnnhiSn^s^pfOteh 3.0 

4525U AI904898 gtxRC-BT068-130399485 BTD58 Homo sapfen3.0 

429500 X78565 H&289114 hexabfachfcn (temscin C, cytotactin) 3.0 

432485 N90866 HsJ76770 CDW52 antigen (CAMPA7H-1 anfigen) 3D 

10 437400 AB011542, Hs^599 EGF-Ste-ckxnaln, muHple 5 3.0 

452234 AW084176 Hs^2329S ESTs, WeaWysinnBartol38022hypothea 3.0 



413269 BE167526 gb^M4-Hn)50W)80304>-107-g07 HT0509 Homo 3.0 

453216 AL137566 Hs32405 Homo salens mfftlA; cDNA DKFZp586G0321 (f 3,0 
400929 ENSP00000252232^Steidreg^ 3.0 



15 445145 AI951702 Hs.147434 ESTs 3i) 

432615 AA557191 Hs^5Q28 ESTs, WeaWy simDar to 154374 gene NF2 3.0 

423279 AW959861 H&290943 ESTs 3.0 

429392 AL109712 H&296506 Hoino sapiens mRNA full tengfo Insert cDN 3.0 

408548 AA055449 Hs.63187 ESTs, Weakly similar to ALUC_HUMAN till 3.0 

20 451346 NM_006338Hs26312 gfioraainpriedcfl chroma 3.0 

413109 AW389845 Hs.1 10855 ESTs 3.0 
401714 NA ENa 3 00000241802*^)DNA FU1 1007 FIS, aON 3.0 

421462 AP016495 Hs.104624 aquaporfn9 3.0 

421750 AKD00768 Hs.107872 rrypofoefcal protein FU20761 3.0 

25 453293 AA382267 Hs.10653 ESTs 3.0 

457085 AA412446 HsS8138 ESTs 3.0 

438930 AW843633 H&306163 hypothetical protein AL1 10115 3.0 



* 
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TABLE 22A 



10 



15 



20 



25 



30 



35 



Table 22A shows the accession numbers for those pkeys lacking unigenelD's for Table 22. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset Werrtffier number 

CAT number Gene duster number 
Accession: Genbank accession numbeis 

Pkey CAT number Accessions 

407980 103087 J AAQ46309AE63500AA046337 

410785 1221055J AWB03341 AW803265 AW803403 AW803466 AW8034Q2 AW803413 AW80326B AW803396 AW803334 AW803355 

411743 1256038J AW862214AW859811 AW862215 

412138 1Z79172J AW895387AW895547AW895564 AW895323AW895405AW895539AW895538 

413259 1356981J BE167526 BE167651 BBJ76401 R24654 

416935 153179J AA190712AA190665AA252564 

422128 211994J AW881 145 AM90718 M85637 AA304575 T06067 AA331991 

423945 233566 1 AA410943 AW948953 AA334202 AA332882 

424109 235506J AW406878 AW966560 AW966151 AW966498 AA336174 AA335376 AA335537 

424128 235728J AW966163AA335883 AA336011 AA335668 AA335973 

425331 250199.1 AW962128 AA355353 AA427383 

426878 273265 J BB068341 AW748403AL044891 AI908240 AA393Q80 

43Z745 353673.1 AI821928AA658826AA564492AA635129AI791191 

441153 51081.2 BB6282B BE378727 

448212 755099.1 AW75B58AW969013 

451128 859865J AL118668 078823 AI762176 

452514 920172,1 AB04898AB04849AJ904899 

456207 165078.-1 AA193450 
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TABLE 22B 



Table 22B shows the; genomic positioning for those pkeys lacktog unigene ID's and 
5 accession numbers in Table 22. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

10 Refc Sequence souixl The7d^miiitoerelnthiscc!u™ numbers. "Dunham I.etaL refers to the pubBcation 

enffiledTTieDNA sequence tftoimanchto^^ Dunham I. et aL, Nature (1999) 402:489495. 
Strand: ImficatssDNA strand ftaro which axons were piedicted. 

NLposfBon: toaTcatesmideo^posfftra 

15 



20 



25 



30 



35 



40 



45 



Pkey Ref 


Strand 


NLposftion 


400814 8569925 


Minus 


7284072924,74761-74849 


400929 7651921 


Minus 


122033-122241,123483-124028 


401045 8117619 


Plus 


90044-90184.91111-91345 


401451 6634068 


Minus 


119926-121272 


401519 6649315 


Plus 


157315-157950 


401645 7657839 


Minus 


34986-35133 


401714 6715702 


PHs 


96484-96681 


401866 8018106 


Plus 


73126-73623 


402327 7656695 


Minus 


108675-108770,109801-109910 


402359 9211204 


bfinus 


4040341961 


402408 9796239 


Minus 


110326-110491 


402470 9797107 


Plus 


195129-185776 


402542 9801558 


Minus 


67076-67594 * 


402578 9884928 


Plus 


66350-66496 


402606 9909429 


Minus 


81747-82094 


403011 6693597 


hflnus 


3468-3623 


403212 7630897 


Minus 


156037-156210 


403329 8516120 


Plus 


9645006598 


403366 8783692 


Minus 


4932349652 


403465 9966528 


Plus 


28884001.3198353&36554117 


404347 9B38195 


Pius 


7449374829 


404560 6539738 


Minus 


240588-241589 


404755 7706327 


Minus 


5372^53846 


405017 6532084 


Plus 


35551-35690 


405348 2914717 


Minus 


4331043462 


405381 6006921 


ftftus 


763&8054 


405801 2924321 


Ptus 


6346^3694 


405850 6164995 


Phis 


13871-14110 


406153 9928734 


Minus 


12902-13069 


406348 9255985 


hfinus 


7175471944 
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TABLE 23: 320 GENES DOWN-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT BREAST 

5 Table 23 shows 320 genes down-regulated in breast cancer compared to normal adult breast. 
These were selected as for Table 22, except that the numerator was set to the median value 
for 12 non-malignant breast specimens, the denominator was set to the median value amongst 
the 73 breast cancers, the 90* percentile value amongst thel2 non-malignant breast 
specimens was greater than or equal 80 units, and the ratio was greater than or equal to 4. 0 
10 (Le. 4-fold down-regulated in tumor vs. normal breast). 



Ptey. Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genfaank accession number 

15 UnjgenelD: Un^ene number 

UnjgeneTtte: Un&enegenetffie 

R1: Rafb of 50* percentile norn^bo^ 



20 Ptey ExAccn UnfeeneD UnfeeneTffle Ratio 

428722 U76456 Hs.190787 tissue Inhibrtor of metaJtoprotBtnase4 22.4 

428848 NMJJ00230 Hs.194236 teptin (murine ctestyhwnobg) 17.4 

445263 H57B46 Hs.42586 IOAA1560 proten 15.4 

25 418935 T28499 Hs39485 carbonic anhydrase IV 15.0 

407228 M25079 Hs.155376 hemoglobin, beta 143 

417511 AL049176 Hs.8??73 chordWB© 143 

439285 AL133916 Hs.172572 hypdhe^ protein RJ20093 143 

412442 AS83730 Hsl26530 seiumdeprhrato response (pho^ihafidyl 133 

30 410544 AW46543 Hs35511 ESTs 123 

412047 AA934589 Hs.49696 ESTs 1Z2 

422667 H25642 Hs.133471 ESTs 1Z0 

406664 134041 Hs3739 ojycen^hosphats dehydrogenase 1 (so 124 

423201 NMJD00163 Hs.125180 growth hoimone receptor 11.7 

35 422163 AFQZ7208 Hs.112360 prornrrdn (mouse>-&e 1 103 

428769 AW207175 Hs.106771 ESTs 103 

407049 X72632 NMJ521724*:Homo sapiens nudear receptor 10.1 

412295 AW088826 Hs.117176 poJyCAH^ng protein, nudear 1 9.8 

425126 N32759 Ks.172944 chorioi^o^rai^ 93 

40 406791 AB20684 H&272572 hemogloba%8Jpha2 05 

447471 AFQ39843 Hs.18676 sprouty (DrosophSa) homotog 2 93 

451533 NR004657 H&26530 serum deprivation response (phosphatidyl 9.4 

419407 AW410377 Hs.41502 hypdhetkapjote»FU21276 93 

411339 AB65585 Hs.146246 ESTs 9.0 

45 410532 T53088 Hs.155376 henwgtobm, beta 8.9 

425707 AF1154Q2 Ha. 11713 ETWte factor 5 (efedomamtranscnjrt 8.8 

416585 X54162 Hs.79386 lebmodh 1 (smooft muscle) 87 

443060 D78874 H&8944 procoQa^(>eridopepfidaseertfiancer2 83 

4352G5 AA779958 Hs.185932 ESTs 83 

50 422511 AUD76442 Hs.117938 coflag^ type XVU, alpha 1 8.4 

433138 AB029496 Hs39729 semaphorin sent2 83 

402195 ^004497^rtomosapienshefK^ 8.1 

429350 AT754634 Hs.131W ESTs 8.1 

445107 AI208121 Hs.147313 ESTs, Weakly srmflarto D8022 hypothefi 8.1 

55 406643 N77976 H&272572 hemcgk*Ha|pha2 8.0 

410199 AW377424 H&2D5126 Homo sapiens cDNA: FU22667 fis, dons H 8.0 

417225 AA815048 H&24078 rrypothe^ protein RJ1^49 73 

437569 AA760849 HsJ294052 ESTs 73 

436062 AKD0C027 Ks.96633 ESTs 73 

60 425078 NMJQ02599 Hs.154437 prosprKrfresterase 2A, cGMP-stimubted 73 

430327 AW973636 Hs.55931 ESTs 7.4 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



447577 
446039 
422060 
424455 
424343 
442792 
406714 
407571 
429580 
453500 
422233 
420205 
404368 
447261 
417090 
410677 
427283 
415011 
412)68 
416253 
435885 
402779 
418138 
439335 
427019 
411478 
452654 
447359 
414323 
441266 
417011 
400089 
433614 
440439 
454404 
438704 



419313 
409196 
410882 
453469 
441899 
426210 
413065 
454192 
425187 
429757 
420202 
416284 
428553 
404689 
438887 
406082 
449748 
431048 
452205 
434040 
407744 
450606 
414629 



A1393693 

A1150491 

R20893 

AA452006 

AW956360 

AB52340 

AI219304 

A1446183 

AA346839 

AI4784Z7 

AB002D58 

AA256395 

NA 

NNL006691 

AA193282 

WL0Q3278 

AL1197S6 

AW963085 

S72043 

BE250659 

AA701463 

NA 

AA213626 

AA742697 

AA001732 

BE143068 

BED04783 

NNL012093 

NNL014759 

H15968 

P08212 

NA 

W07475 

N92813 

BB067414 

AA062610 

NA 

AW451023 

AA843387 

NMJD01874 

AW809163 

AB014533 

AI372588 

AA372052 

BB063555 

AW876813 

AW014486 

AW452355 

AL036557 

AI695473 

AA181641 

NA 



S47833 

H23963 

R50253 

C15819 

AW444613 

AB020629 

AI668605 

AA345824 



436107 739079 
444432 A1161428 
434715 BBX)5346 
447205 BE617015 
408122 AI432652 



Hs.183297 

H&90756 

H&325823 

Hs533199 

Hs.4748 

Hs.131194 

H&283108 

Hs.9572 

H&209100 

Hs.43125 

Hs.1 13275 

H&88156 

Hs.17917 
Hs55863 
Hs.65424 
Hs.174185 

Hs.73133 
Hs.15463 
H&36341 

Hs.136204 

Hs.62492 

Hs.173233 



Hs.18268 
Hs534688 
H&293845 
H&234898 

H&277101 
Hs.64754 

Hs.148050 

Hs55848 
Hs57279 
H&334S73 

H&33010 

H&8022 

Hs534559 

Hs5343 

H&22S09 

H&256037 

H&95910 

H&298006 

Hs.184907 

H&265499 
H&62927 
Hs.32043 
H&249129 

H&288809 
H&38095 
Hs.60380 
Hs.76688 

Hs.191194 

Hs.75916 

H&116410 

Hs.11006 

Hs.42824 



DKFZP566F2124protem 7.4 

ESTs 7.2 

ESTs, Moderately ^milarb ALUS J^UMAN A 12 

ESTs 7.1 

adenylate cyclase activating polypeptide 7.1 

ESTs 7.0 

hemogtobh, gamma G 63 

ESTs, H^hly simflar to CYA5_HUMAN ADENY 65 

DKFZP434C171 protein 6.7 

esophageal cancer related gene 4 protein 6.7 

purinenjfc receptor P2X-fte 1, orphan r 6.7 

ESTs 65 

ENSRHXXX)241075*:TRRAP PROTBN. 6.6 

extrBceS^arbikcbrralrfCCfitatning 1 65 

ESTs, WeaWy sfrnfar to B34612 zinc fing 65 

tetranectin (ptasrrBTtogen-btntfing protein 65 

ectonucteofidB pyrophosphatase^ 65 
gbfST375158 MAGE resequences, MAGH HomoM 

rraSallofotor^ 3 (growth inh^^ 6.4 

Homo sapiens, done IMAGE2959994. mRNA 6.4 

ESTs 65 

Target Exon 65 

EST 65 

ESTs, WeaHy similar to B39066 proBne-r 65 

hypdhe1i(^proteltRJ1097O 62 
&M®WQ^SWmlmMm HT0158 Homo6.2 
gb:^-BN01 14-27l>MXM)04^11 BN0114 Homo6.1 

adenylate kinase 5 6.1 

KIAAQ273 gene product 6.1 

Homo sapiens, dcraBMGE3502329,mRhlA, 6.1 

ESTs, Weak^ strrflar to 2109260A B cell 6.0 

Ecs Control 65 

(^tochrcmeccocktosubunitlVlsoform 55 

ESTs, WeaHysiraTartopotenfei CDS (H 5.9 
gttdR4-OT0355^00-201-e05 BT0355 Homo 5.9 

EST * 5.9 

Target Exon 5.9 

hypcfltefca! proteto DKFZp7610132 55 

ESTs 55 i 

caibagpepfidaseM 55 
cJ>AlR4-ST01 18^1099^)12-a03 ST01 18 Homo 55 

K1AA0633 protein 55 

TU3A protein 55 i 

Kon»sapiBiiscDNAFU14465es l dbrieNE 55/ 
gbcCM1-8T0283^811 WJ33k»9 BT0283 Homo 55 ' 

plttsphcglyceratodehyc^Dgenase 5.7 

ESTs 57 

ESTs 5.7 

puta^ryrnphc^GOJGIsv^c^ 5.7 

ESTs 5.7 

G protein-coupled receptor 1 5.6 

Target Exon 5.6 

ESTs 55 

aderx^nioiiophosphatecteam 5.6 

ESTs 55 

ceHo^aflKMK^O^A^eo^dDTa 55 

gb:C15819 Ctontech human aorta polyA mRN 55 

hypoto^protetoFU20159 55 

ATP-btncfing cassette, suMarriy A (ABC1 55 

ESTs,ModefaierysirrflartoAL^^ 55 

carboxytesterase 1 (rnonocyteMwoophage 55 

C1100rj703xjr]1fJ048448Iref|NP^ g 55 

ESTs 55 

spfcargto3b,$ubwBt2,145kD 55 

ESTs 55 

ESTs,rvto(teiBterysiJiiartoT17372p^ 55 

hypofoetfccd protein FU10718 55 
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454016 


AW016w)6 


H&2331Q8 


ESTs 


5.5 


414913 


R25621 




gb.7M5»8j1 Soares placenta Nb2HP Homo 


SA 


459033 


AAD1758) 


ll_ 4«VWV7 

HS.1Z990/ 


ESTs 


5.4 


441003 


BE172240 


Hs.126379 


ESTs, WeaHy slmBar ta 138022 hypotefl 


5.4 


450637 


N49826 


Hs.18602 - 


ESTs 


5.4 


44239S 


AA994520 




gb:ou42g09.s1 Soaies lNFULGBCJSI Homo s5.4 


403612 


NA 




Target Exon 


53 


407102 


AA007629 


Hs.9739 


gtycefol-3-phosphate dehydrogenase 1 (so 


5.3 


410057 


R56634 


Hs.268107 


muEfimerin 


53 


428232 


BE272452 . 


Hs.183109 


monoamine oxidase A 


53 


432769 


AA620814 


Hs.144959 


ESTs 


53 


431344 


R99530 * 


HsJ272572 


hemoglobin, alpha 2 


53 


427032 


AF012023 


Hs.173274 


ii^rlncytopiasnfc domain-associated p 


53 


406305 


BE26132D 


Hs.158196 


transcriptional adaptor 3 (ADA3, ycasl h 


53 


437411 


AW613948 


Hs.194915 


ESTs 


53 


442800 


AI809481 


Hs.131227 


ESTs 


53 


402054 


NA 




TaigetExon 


53 


432085 


AF212829 


Hs^72406 


potasshim channel, aWamDyK, member 9 


53 


415313 


R59638 


Hs.6181 


ESTs 


^2 


459159 


AI904646 




gkQV-BT06WJ20399-1Q3 BT065 Honx) sapten 5\2 


427164 


AB037721 


Hs.173871 


WAA1300 protein 


52 


441391 


BE467930 


Hs.170381 


ESTs 


52 


458959 


A1285301 


Hs.181297 


ESTs 


52 


402698 


NA 




ENSP0(H)0Q251335^DJ1D03J2.1 (sodium and 


52 


401810 


NA 




TaigetExon 


52 


438879 


AA827674 


Hs.189073 


ESTs 


52 


414657 


AA424074 


Hs.76780 


proteh phosphalase 1, regulatory (Inhfb 


52 


427809 


M26380 


Hs.180878 


apoprotein lipase 


5.1 


456063 


NMJX6744 


Hs.76461 


refinoMxVtdlRg protein 4, LilnaSSal 


5.1 


451186 


AWQ23469 


Hs.65256 


ESTs, WeaWy slmSarto leutine-rfch gfi 


5.1 


451882 


AI821324 


Hs.100445 


ESTs 


5.1 


402583 


NA 




NMJ)216ahHorno sapiens PR domain contain 


-5.1 


431130 


NMJD06103 


H&2719 


HE4; WFDC2: putafoe ovarian caictooma m 


5.1 


458218 


AW35179 


Hs.126820 


ESTs 


5.1 


416083 


R53467 . 


H&269122 


ESTs, WeaWyslmBar to ALUIJiUMAN ALUS 


5.1 


455282 


BE143867 




gbM^fTO16447010(M)iaM2 HT0164 Homo5.1 


42648$ 


X03350 


Hs.4 


alcohol dehydrogenase 1B (dass % beta 


5.1 


426156 


BE244537 


Hs.167382 


natriure&c pepSde receptor A^juanyfate 


5.1 


407891 


AA486620 


Hs.41135 


80domudn-2 


5.0 


408610 


AW026692 


H&224829 


ESTs 


5.0 


445967 


D59597 


Hs.118821 


CG1-62 protein 


5.0 


434813 


A1524307 


Hs.162870 


ESTs 


5.0 


437526 


AKJ76012 


Hs.121388 


ESTs. Weaxty simDar to MDHC JiUMAN MALAT 5j0 


454775 


BE16Q229 




gb*W14fr041349020OO62-a12 HT0413 Homo5i) 


409451 


AF012626 


Hs54472 


fragile a rneniai re$araatton c 


5.0 


409853 


AW502327 




o^>lJW-BR0p4kB4074Uilj1 NtH_MGC_5 


5.0 


405062 






Target Ban 


5\0 


446490 


AKD00706 


Hs.15125 


hypothetical proteh RJ20O9 


5.0 


417622 


AW298163 


Hs32318 


WAS protein family, member 3 


5.0 


421978 


AJ243662 


Hs.110196 


NICE-1 protein 


5.0 


440338 


R62431 


Hs.12758 


ESTs 


5.0 


4154Z1 


R35009 


rraLZ'KHJo 


ESTs 


5.0 


417574 


R00348 




gkxye&9e06j1 Soares fetal Iter spteen 


5.0 


409882 


AJ243191 


H&56874 


heat shock 27H) protein family, metnber7 


5.0 


447S98 


AI768289 


H&304389 


ESTs 


4.9 


445613 


6E550889 


Hs.158491 


ESTs 


4.9 


443074 


AW341470 


Hs.144907 


ESTs 


4.9 


451324 


AI783600 


Hs^08Q52 


ESTs 


AS 


432433 


AW014734 


Hs.157969 


ESTs 


AS 


449654 


AB89812 


Hs.199850 


ESTs 


AS 


414519 


N94567 


Hs»55063 


ESTs 


AS 


457531 


AW973716 


Hs.13913 


WAA1577 protein 


AS 


433200 


AA68Z722 


Hs.192725 


ESTs 


AS 


430782 


AF026263 


H&247920 


choSnerglc receptor, muscarinic 5 


AS 


427555 


AW137094 


H&97990 


ESTs 


AS 


433545 


AA868510 


Hs.112496 


ESTs 


4.8 


420334 


AB49351 


Hs.118944 


bypofoetfcal protein FU22477 


43 
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421795 
427138 
413072 
443721 
408053 
427067 
442969 
426220 
414593 
426893 
434046 
401590 
457971 
427722 
443793 
407737 
441955 
441499 
447517 
403017 
450580 
404611 
414831 
459290 
444341 
408614 
449638 
434418 
447360 
419583 



X63094 
N77624 



451199 
438338 
433756 
423301 
417237 
439745 
424137 
449338 
434744 
407402 
443510 
415754 
415986 
457416 
418064 
437120 
453950 
401093 
436935 
457974 
428222 
442705 
437409 
458494 
410490 
416069 
438463 
444043 
438327 
457711 
400870 
441425 
416267 
407262 
444567 
403263 



AW450451 

AW139474 

AA843716 

A1Q25499 

AJ383475 

BE386764 

AA398716 

AW292618 

NA 

AW134679 

AKD00123 

AAO45290 

R49187 

AA972327 

AW298235 

AI382726 

N40087 

H58589 

M31158 

NMJJ01546 

AI1420Z7 

AL137698 

AW204277 

AF134707 

A075984 

F00312 

AB48455 

A1290653 

NML014861 

AW015933 

S67580 

H86385 

AL389981 

AA335769 

H73444 

N94835 

AF035303 

NM.012190 

AA169114 

Z43619 

BE142052 

BE387287 

AB56125 

AA156998 

AW206494 

AW842353 

AL133112 

AQ64634 

AL359599 

AB80906 

H03569 

R37101 

AA807958 

A1499723 

K87407 

AF147401 

AA933590 

H45384 

M12873 

AV654Q20 

NA 



Hs383822 
Hs.173717 

H&266355 
H&246862 
Hs.177927 
Hs.132238 
Hs.171697 

Hs,97418 
Hs.113011 

Hs^42849 
Hs.180479 
Ha25830 

Hs.142903 
Hs.101689 
Hs.182434 

Hs.15248 

H&35156 

Hs.77439 

Ha34853 

Ha.146650 

Hs.46531 

Hsl250723 

Hi278679 

Hi167216 

Hs.147492 

Hs.124758 

Hs.6168 

Hs.112654 

Hs.1645 

H&81737 

Hs.149219 

Hs.16262 

Hs394 

H&2B382& 

H&9520 
H&12247 

Hs.62654 
Hs33384 
Hs.157767 
H&211568 

Hs353560 
Hs321717 
Hs.183085 
Hs.131127 
Hs283850 
Hs.158436 

H&20982 
H3314232 
Hs.135089 
Hs.172944 
H&23917 

H&28937 



Hs.184261 



Rhesus blood group, D antigen 45 

phosphaftfic acid phosphatase type 2B 4.8 
gbcQV»TQ29&-14Q20aOB54i01 BT0296 Homo 43 

ESTs 4.8 

ESTs 43 

ESTs 4.7 

ESTs 4.7 

ESTs, Weafysirrdlar to T13924s& prate 4.7 
gfc601273249F1 NiH_MGC_20 Homo sapiens c 4 J 

ESTs 4.7 

ESTs 4.7 

Target Exon 4.7 

ESTs 4.7 

hypo^eecal protein HJ201 16 4.6 

ESTs, WeaJdy slrralar to 2109260A B cefl 4.6 

ESTs 4.6 

ESTs 4j6 

ESTs 4.6 

ESTs 4.6 

Target Exon 45 

ESTs 4.6 

Homo sapiens cDNA FU 11027 6s, clone PL 4.6 

(xotinkbas^c^-depeno^re^ 4.6 

nhatoofDrMbtrto^4.dontiartneg 4.6 

ESTs 43 
Homo sapiens mRNA; cDNA DKFZp434C1915 943 

hypothefcal protein MGC2747 4.6 

a cDsmtegrin and rnetaSoprotetnase doma 45 

ESTs 43 

gfcHSBBODIOI STKATAGB4E Human skeletal ro45 

Homo sapiens cDNAFU11777fis, done HE 4j6 

ESTs 4.6 

K1AA07O3 gene product 4.6 

Honxsap^dcm8MG(^64 > m^comp 4.5 

cytochrome P450Lsub^^ 45 

palrn&Dy)-pfoteinflik)esterase2 45 

Homo sapiens mRNA fuBlengft insert cDN 45 

ESTs 45 

adrenomedullbi 45 

Homo sapiens genomic DNA, chromosome 21 q 45 

p>d4c4Ttosarienscbne2^43rrtf^sequem; 45 

rouiry^ahyctoofolata dehydrogenase 45 

hypothetical protein FU11413 45 

gbcHSC1GE121 notmafized Infent brain cDN 45 

knTigJe-OTrtainlng transmei Jtbrane protein 45 

S1Q0cak*jn)^)ir^prote^ 4.4 
ESTs. WeaWys!mfl8rtoHXA2JUJMAN HOME04.4 

euXarjfutic translation WUaBua factor 4.4 

C12000586^gq6330167|oipAA86477.1|(A 4.4 

ESTs 4.4 

ESTs,WeaHysirftnartoS2Z7Kteterogen 4.4 
r^sapwrrf^ctmra<FZp434K0^(fr4^ 

ESTs 4.4 
Homo sapiens mRNA; cDNA DKFZp547C126 (fir 4.4 

ESTs 4.4 

gb^|42d03j1 Scares piacerfeNb2rPHc^ 4.4 

ESTs 4.4 

ESTs 4.4 

ESTs 4.4 

chononfcgcoiadotopirHbetep 4.4 

ESTs 43 

C11000905gi)1169K65)gb|AA639879.1lAF28 45 

homecd»xprc^6iomALS0526 43 

gtKyn99c10j1 Soafes adu&brahN2b5HB5 45 

gtxHuman \g rearranged H-chain mRNA VDJ4 43 

ESTs,WeaHysintotoT26686hypomefl 43 

Target Exon 43 
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410034 BED67414 
456804 AW21645 
448427 BE3952G0 
416931 D45371 
421296 NM.002666 
400973 NA 
452602 AW366194 
412330 NML005100 
405016 

435104 AW75671 
406118 

418556 TQ2850 
429745 AA480818 
433088 AW451206 
444445 AA342329 
453880 AI803166 
447384 A077221 
414541 BE293116 
444975 AV652165 
403921 NA 
451477 AF798425 
406344 

416970 AA1912D1 
413662 BE155866 
458504 AW070634 
404682 NA 
418089 N89913 
403433 

446532 AW975460 
414217 AB09298 
418425 AB71247 
4 19589 AW973708 
457029 AA397789 
447860 AF193807 
448988 Y09763 
440610 AI733098 
439690 AF086410 
427240 AA399975 
408932 AW594172 
436112 T77545 
444382 AI144152 
456716 AA318060 
419846 NMJT15977 
410036 R57171 
400545 NA 
403051 NA 
420139 NM0Q5357 
450244 AA007534 
453261 AA034116 
440246 W52010 
414516 ABO7802 
438232 AI150595 
410233 AA082947 
412179 BE270758 
441871 ABG6150 
426411 AKD00708 
453692 AL110416 
448640 AW817177 
417481 AA203281 
412912 AW118678 
454183 AW807116 
426328 AW631296 
435942 R06285 
417629 T76945 
403593 NA 
402690 

416190 R49591 



Hs.139851 
Hs3Q9438 
Hs30485 
Hs.103253 

H355962 
Ks.788 

H&88607 



Hs£21736 

Ks.115899 

Hs.1 15920 

Ha2S462 

Hs.40528 

Hs76392 

Ks.182482 

Hs.42710 

Ks35861 
HsJ25522 
Hs.144794 

Hs£858 

Hs143563 

HS779898 

H&6262 

H&201925 

Hs.161803 

Hs.131835 

H&22785 

Hs.130800 

H&274151 

Ha278513 

Hs.187559 

H&58246 

Hs.135121 

Hsi85681 

Hs57975 



HsJS351 

Hs.125062 

Hs.118494 

Hs.191379 

Hs.135560 

Hs.122226 

Ks.69428 

Hs.153450 

Hs.169764 

Hs.102558 

Hsl2179B 

Hs.110835 



Hs.191215 



Hs^70425 



gbWR4^T0355-200100-201-€05 BT0355 Homo43 



caveoBn2 
EST 

appose most abundant gene transcript 1 



ENSP00tXK)236S67^Mudn5B {Fragment}. 
ESTs 

A kinase (PRKA) anchor protein (gravm) 

CY0^171^p280405|gb|AAF86402.1|AF245 

ESTs, HIgWy sfrnBar to F-bc* protein FB 

ENSP000Q0246632SONA FU2Q261fis, clone 

gfcFB12A9 Fetal brab, Stratagene Homo s 

ESTs 

ESTs 

Homo sapiens cDNA: FU22816 fts, clone K 
ESTs, WeaWy sirnflar to 138022 hypothefi 
ESTs 

aldehyde dehydrogenase 1 family, member 
ESTs,WeaWysrMartoTTK^hypoJhetJ 
C5000212^g510047237|dbpAB13407.1| (A 
ESTs 

C5001660^qi1811537|d&3BAB18935.1| (AB 
DKFZP586E1621 protein 
WAA1806 protein 
ESTs 

C90CM188^12738842tiefp^Pjy7372S.1| p 
ESTs. WeaHy sfmferto 178885 serihe/Eh 
NM.001622itemo sapiens arpha-2-HS-glycop 
ESTs 

Homo sapiens cDNA: FU23165 fis, don© L 

hypothetical protein MGC8407 

Homo sapiens CDNAFU13446&, done PL 

ESTs 

Rhesus blood group, B glycoprotein 
gamma-aminobulyric add (GABA) A recepto 
ESTs 

gtxBomo sapiens ftiB length Insert cDNA 



TP53TG3 protein 
ESTs 
ESTs 

hypothetical protein FU22415 
WQQams-Beiffen syndrome chrornosome regi 
cabequestnn 2 (cardiac rnusde) 
Target Exon 
Target Exon 

Spase, horrnono^sensrfive 
ESTs 
ESTs 
ESTs 

ESTs, Weakly stmttar to T43458 hypofoefj 
ESTs 

gfczn10g07.s1 Stratagene hNT neuron (937 
hypoteSca) protein MGC3020 
ESTs, Weakly sMferto 1909123A Na giuc 
hypothetical proteto FU207O1 
o^KF2i)434K0431 jl 434 (synonym: htes3) 
Hc^sap^c^h^3^n«NA,a)mp 
ESTs 
ESTs 

gWi/R4-ST0062^OH)2^12 ST0062 Homo 4.1 
gbmh83c09.y1 NCLCGAPJ5U1 Homo sapiens 4.1 
ESTs 4.1 
Qjxyc92c07j1 Soarestmantbratn 1NB H 4.1 
Target Exon 4D 
Target Exon 4.0 
ESTs 4.0 



43 
43 
43 
43 
43 
4.3 
43 
43 
43 
43 
43 
43 
43 
43 

i 4 - 3 
42 

42 

42 

42 

42 

42 

42 

42 

42 

42 

42 

42 

42 

42 

42 

42 

42 

42 

42 

42 

42 

42 

42 

42 

42 

42 

42 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 
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408641 AW245207 Hs3555 rrypofoetical protein MGC5347 4.0 

427899 AA829286 Hs332053 serum amyloid A1 4.0 

445975 AB11536 Hs.145734 ESTs 

438831 BE263273 HsJSA39 synapsfoU 4.0 

455578 BE008350 Hs.14355 Homo sapiens cDNAFU13207fls, done NT 4.0 

401840 NA Target Exon 4.0 

413753 U17760 Hs.75517 lamWn, beta 3 {rtein (125kD), kaBnin 4.0 

445030 A12D5925 Ks.147238 ESTs, Hlghry straSar to AAC3JKJMAN ALPHA 4.0 

433873 AW156913 Hs.150478 ESTs, Wea^simtertoAChaihA,C»yst 4.0 

456736 AW248217 Hs.1619 achaetB-scutB complex (Drosophte) homol 4.0 

450112 BEM7734 Hs5473 ESTs, Moa^tely sfriBar to AUBJtUMAN A 4X1 

448906 AI5895G7 Hs3Q9719 ESTs 4.0 
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Table 23A shows the accession numbers for those pkeys lacking unigeneEVs for Table 23. 
For each piobeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Ptey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 

409853 1156226 1 AW5G2327 AW5Q2488 AW501829 AW502625 AW502687 

410034 1170594 1 BE087414 BE067958 BE067419 BEDS7963 AW577127 AW601412 

410233 118656 1 AA082947 AA083036 

410490 1205347 J H03589AW750687 AW750688 

410882 1225686 1 AW809163 AW809247 AW809177 AW809190 AW8Q9225 

411478 1247073L1 BE143068 AW849143 AW848705 AW848569 AWB48071 AW84S475 AW848092 AW846005 
413065 1347960J BHJ63555 BE151321 BE151319 BE151657 BE151655 BB063556 BE151322 

413072 1348163 1 BB083965 BE063968 BE064034 BE064028 BE063874 BE063966 BH163869 BED64043 BE()64a33 BED63884 

414593 1464909J BE386764 BE387560 

414913 15067211 R25621 C03959 C04010 

415011 151328J AW963085 AA159005 AW963073 

415986 156441QL1 Z43619 R61Z74 H12206 R12883 

416267 1583547J H45384 H49125 H41699 

417574 1687770 J R00348 R09593 

417629 169Q392J T76945 R2Q210 R05755 

418556 1767866_-1 T02850 

419583 186198 1 F00312 AA247490 F31427 AA383663 F22045 

426328 264901J AW631296 AA375484 

439590 47413 1 AF086410W94386 W74609 

442398 541Z71J AA994520AW393574 

452205 90415J C15819 AA024741 AA024742 

452654 925931.1 BE0047B3 BE 004947 AI911790 

454183 104^ 1 1 A^lTs AV^TO AW80741 5 AW807338 AW807288 AW807263 AW8073 1 6 AW1 77402 A W8074 1 3 AW807058 3>1 
" BE141569AW807401 AW807310BE141565 AW807318 AW807119AW807299 AW^41 AVV807225AW^^A)J^m5 
AW807103 BE141615AW807431 AW807393 AW807337 AW807406 AW807259 AW807375 AW845890 AWB07220 AW807399 
AW8O7064 AW807376 AW807G24 BE141595 AWB07236 AW807027 AW8071 12 AW807378 AW8072Q2 BE141S3AV^16 
AW807138 AW807244 AW807221 AW807297 AW807050 AW807248 AW807404 AW807075 AW807237 AW80721 2 AW807308 
AW807110 AW807104 BE140912 AW807301 AW807382 AW807294 AW807026 AW807020 AW807108 AW807025 AW8074^ 
AW807124 AW8O7419AW807031 AW807264 AW807032 AW807029 AW807052 AW807391 AW807207 AW807215 AW807019 
AW807238 AW807201 BE141590 AW8073Q2 AW807323 AW807380 AW807109 BE141588 AW845877 AW807418 AW807407 
AW807309 BE141614 AW845861 AW807396 AW807300 AW807348 AW807311 AW807214 AW807132 AW807402 AW807350 
AW807Q28 AW807298 AW807291 AW8073Q5 AW807217 AW807312 AW807261 AW807268 AW807260 AW807266 AW807198 
AW807346 AW807315 BE140940 AW807351 AW807122 AW807235 AW807076 AW807314 AW807213 AW807143 AW807131 
A W800079 AW807066 AW807077 AW845923 AW807421 BE141619BE140943 AW807420 AW807381 BE141621 AWM5921 
BE141629 BE141625 BE141624 BE141636 BE141630 AW807405 AWB07290 AW807353 

454404 1170594J BE067414 BE067958 BE067419 BE067963 AW577127 AW601412 

454775 1234106J BE160229 AW819879 AW820179 AW819882 AW819876 AW820169 BE153201 AW993736 BE152911 
455282 1273Q20J BE143867 AW935060 AW886684 
459159 919998J AO04646 BE179494 BE179421 
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TABLE 23B 



Table 23B shows the genomic positioning for those pkeys lacking unigene CD's and 
5 accession numbers in Table 23. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 
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Ptey: 
Ret 

Strand: 
NLposftbn: 



Unique number corresponding to an Eos probeset 

Sequence source. Tne 7 numbers In thbcohimn are Genbank Identifier (Gl) numbers. ThinhamLetaL'retotothepubOcatfon 

entffiedTteDfMsequerreofhu^ Dunham I. et aL, Mature (1999) 402:489^95. 

Indicates DMA strand from which exons were predicted. 
Indicates rructeotkte posffions of predicted axons. 



Pkey 


Ref 


Strand Mlposition 


400545 


9800107 


Minus 


124618-124881 


400870 


9838306 


Minus 


34081-35027 


400973 


7950452 


hfinus 


9311948253 


401093 


8516137 


NBnus 


22335-23166 


401590 


9966320 


Minus 


33547-33649 


401665 


7145001 


Plus 


121591-122537 


401810 


7342191 


Plus 


129063-129476 


401840 


7684597 


Pbs 


5628356439 


402054 


8083691 


Mfrus 


82884806 


402195 


7689778 


Mrnus 


147901-148884 


402583 


7684486 


Pbs 


94883-95003 


402690 


8348058 


Pbs 


13368-13998 


40388 


8570304 


Mnus 


108641-108903 


402779 


9588555 


Minus 


3817W3210 


403017 


6693623 


Pbs 


7863079367 


403051 


4827080 


Minus 


52695411 


403283 


7770677 


Pbs 


52431-52737 


403433 


9719611 


Mkius 


7222W2437 


403593 


6862650 


Minus 


62554-62712,6944939602 


403612 


8469060 


Minus 


94723-94859 


403921 


7711590 


ftfnus 


3297-3536 


404368 


7630956 


Minus 


102053-102199 


404682 


9797231 


Minus 


4097741150 


4046B9 


7534100 


Pbs 


119461-119717 


405016 


6524300 


Pbs 


5199753308 


405062 


7657730 


Pbs 


101283-101432 


406118 


9143818 


Pbs 


53997-64629 


406344 


9255974 


Pbs 


20254-20374.2052640659.2083&21097 


406563 


7711604 


Pbs 


34401*34538 
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TABLE 24: 

Table 24 depicts Seq ID No., UnigenelD, UnigeneTitle, Pkey, Pred.CeU.Loc, and ExAccn 
for all of the sequences in Table 25. The infonnation in Table 24 is linked by Seq ID No. to 
Table 25. 
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Pkey: Unique Eos probesettdentffier number 

ExAccn: Exenipbr Accession wirnber, Genbank accession number 

UnfeeneJD: Unfeene number 

UrageneTKe: Unfeenegeneffie 

Pred.CelLLo&: PnaaldedCe&JbrLocaBzatton 

Seq.tD.No.: Sequence ktenfficafion Number found in Table 25 

Pkey ExAccn UnigenelD UrugeneTltte 



PredCeOloc Seq. ID. No. 



449/46 
407276 
415539 
400297 
450375 
102457 
429170 
424399 
422505 
449765 



42G215 
439640 
410102 
429220 
416276 
409079 
442818 
442082 
444381 
446163 
416636 
442117 
433043 
429353 
452190 
446733 
452747 
423242 
417433 
432201 
423961 



114460 
404561 
325372 
112287 
335824 
424735 
400289 
427585 
429925 
429441 
421155 
420931 
420813 
45Z744 



A1668594 Hs.176588 
AI951118 HS32G736 
AI733881 Hs.72472 
AI127076 Hs334473 
AADQ9647 H&8850 
NNL001394H&2359 
NM.001394HS2359 
AJ905687 H&2533 
AL120862 Hs.124165 
N92293 Hs206832 
D90041 Hs.155956 
AW963419 Hs.155223 
AW449211 Hs.106445 
AW248508 Hs279727 
AW2072O6 Hs.136319 
U41060 Hs.79136 
WB7707 H&82065 
AKD01741 H&8739 
R41823 Hs.7413 
BE387335 H&283713 
AA026880 H&25252 
N32536 Hs.42645 
AW664964 Hs.128889 
W57554 Hs.125019 
AL117406 r^2Tj0102 
H26735 H&91668 
AA863360 Ha26040 
BE153855 H&61480 
AL0394Q2 Hs.125783 
BEZ70266 H&82128 
A1538613 H&298241 
D13666 Hs.136348 
AW6Q2166 HS222399 
BB066778 Hs.151678 

NA 

AB033064 H&334806 
NA 

U31875 K&272499 
X07820 rk225B 
031152 Hs.179729 
NNL000786HS226213 
AJ224172 H&204096 
H87879 Hs.102267 
AF044197 Hs.100431 
X51501 HsJSm 
AI267652 Hs30504 



ESTs, WeaWy san3ar to CP4Y^UMAN CYTOC 
Homo sapiens breast cancer anfigen NY-8R 
BKP-R1B 

hypomefica) proteh DKFZp56401278 

adisttegrinartdmetalop^ 

dual specffictty phosphatase 4 nuclear 

Ad specif phosphatase 4 nuclear 

akteh^oetydrog^^ cytoplasm 

ESTs 

ESTs,l^eiateJysMiartoALU8_HUMANA 

K^ceiyttransferase 1 (arytamme Nicety 

stannkxaJcin 2 

GDNF famBy receptor alpha 1 

Homo sapiens cDNAFU14035 (is, done HE 

ESTs 

U\M protein, estrogen regulated 
UerleuJdn 6 signal transducer (gp13u, 
hypotefical protein FU10879 
ESTs 

ESTs, WeaWysWar to S64054 hypotheti 
Homo sapfens cDNA FU136Q3 lis, done PL 
solute canter tamBy 16 (monocartxixyfc 
ESTs 

ryrr^ix)«mxiearpit^(l^4)mRNA 
ATP-bfcKfing cassette transporter MRP8 
HonTO$ap^ctoneF^498urtoc^rrtfW 
ESTs. WeaUy similar to tatty add omega 
Igsuperfaniify receptor LNIR 
DBU&6 proteh 

5T4 oncofetal trophobtest glyooproteh 
Trarismeiribiarie protease, serine 3 
osteoblast spedftc factor 2 (fasdcBn 
CEGP1 protein 

UDP4tacetyNpteM>^lacto 
WA_014112^Homo sapiens tnttorWraprab rrtahodrta 
Phase2&3Exons nuclear 
WAA1238 protein 

ENSP00W)0249()m)J222E13.1 (N-TERMWAL 
shortchah alcohol Q^rryttaa^nasefernfly 
matrbc nietaflopiotetoase 10 (stromaVste 
collagen, type X, alpha 1 (SchmxJ metaph 
cytocTHoneP450.51(lanosterd14^pte 
IpophSn B (utenxjWhfarnBy mentis 
lysyl oxidase 

smal inducible cytokine B subfamily (Cy 
protedta-hduced protein 



ER 



extracellular 



nuctear 



Homo sapiens mRNA; cDNA DKFZp434B082 (fr 
343 



SeqlD1&2 
SeqlD3&4 
SeqlD5&6 
SeqID7&8 
SeqlD9&10 
SeqtDl1&12 
Seq ID 11 & 12 
SeqlD13&14 
Seq ID 15 & 16 
SeqtD17&18 
Seq ID 19 & 20 
SeqlD21&22 
Seq ID 23424 
Seq ID 25 & 26 
Seq ID 27 & 28 
Seq ID 29 & 30 
Seq ID 31 & 32 
SeqlD33&34 
Seq ID 35436 
Seq ID 37438 
SeqlD39&40 
SeqlD41&42 
Seq ID 43 & 44 
Seq ID 45446 
Seq ID 47448 
Seq ID 49450 
Seq ID 51 452 
Seq ID 53454 
Seq ID 55156 
SeqlD57&58 
SeqtD59&60 
SeqiD61462 
Seq ID 63 & 64 
Seq ID 65466 
SeqlD67&68 
SeqlO69&70 
SeqB>71472 
Seq ID 73 & 74 
Seq ID 75476 
SeqID77&78 
SeqID79&80 
SeqlD81&82 
SeqID83&84 
Seq ID 65 & 86 
SeqtD87&88 
Seq©89490 
SeqlD91&92 
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420757 
424905 
423659 
446921 
445537 
426227 
424001 
421727 
452838 
419667 
414812 
426320 
422867 
411558 
417866 



10 



15 



431958 
428722 
412970 
20 421379 
415752 
444051 
451110 



X78592 Hs.99915 
NM.0O2497HS.1537O4 
NMJX>7050H$225952 
AB012113 Hs.16530 
AJ245671 Hs.12844 
AA321649 H&2248 
W67883 Ks.137476 
Y13153 Hs.107318 
U65011 H&30743 
AU077Q05 H&92208 
X72755 Hs.77367 
W47595 Hs.169300 
L32137 Ha1584 
AA102670 Hs.70725 
AWQ67903 H&82772 
A1249368 H&38558 
XS3629 H&2877 
U76456 Hs.190787 
AB026436 Hs.177534 
Y15221 Hs.103982 
BE314524 Hs.78776 
M48373 Hs.10247 
AB55040 H&265396 



androgen receptor {dihydrotestostercner 
NIMA (never In mftoste gene a>rebted k 
protein tyrosfoe phosphatase, receptort 
smaMlndm^cyfeoktosubtami)yA(Cy 
EGF-fike-domaln, multlpte 6 
srr^trxhjcfolecytoldresuta 
paternally expressed 10 
kyratferdrte34noooc^ 
preferentially expressed antigen tn mete 
a (flsntegrtn and metafloprotetnase doma 
monokine Induced by gamma Interferon 
transfonrtng growth factor, beta 2 
cartilage ofigotnerfc matrix protein (pse 
garnma^tmrnebutyric add (GABA) A recepto 
cofegen, type XI, alpha 1 
ESTs 

cadhsrin 3, type 1, P-catfterin (placenta 
tissue inhibitor of metaBoproteinase4 
duarsped fcfly phosphatas e 10 
smal inducftfe cytokine subferrflyB(Cy 
putative traiisrnentbfane protein 
acSvatedteuoocytsceiadheskvirnofecu 
ESTs, WeaWy sfenDar to transfonrafion-r 



cytoplasm 


r% m no o ft J 

Seq ID 93 & 94 


nuclear 


Seq ID 95496 




Seq ID 97498 


extracellular 


SeqlD99&100 




Seq ID 101 & 102 


extracellular 


SeqlD103&104 




Seq D 105 & 106 




Seq ro 107 & luo 


nuclear 


Seq ID 109 & 110 




Seq ID 111 & 112 


extracellular 


Seq ID 113 & 114 


extracefiuter 


Seq ID 115 & 116 


extracellular 


Seq ID 117 & 118 




Seq ID 119 & 120 




Seq ID 121 & 122 


I 


Seq ID 123 & 124 


plasma membn 


sneSeqlD125&126 




Seq ID 127 & 128 




Seq ID 129 & 130 


extracefcilar 


Seq ID 131 & 132 




Seq ID 133 & 134 




Seq ID 135 & 136 




Seq ID 137 & 138 
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TABLE 24 A 

Table 24A shows the accession numbers for those pkeys lacking unigenelD's for Table 24. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
5 were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

10 Ptey: UnlqueEos probeset Identifier number 

CAT number: Gene cluster number 
Accession: Genbank accession numbers 

15 Ptey CAT number Accession 

335824 CH22_3197FG_619J1JJNDeE 
325372 c12Jb 
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TABLE 24B 

Table 24B shows the genomic positioning for those pkeys lacking unigene ID's and 
5 accession numbers in Table 24. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Ptey: Unique number amesporefing to an Eos probeset 

10 Reft Sequencesoujca 7te7dWmimbe©hfobcQlu™ 

enW^ DM sequence of hur^ Dunham I. et aL, Mature (1999) 402:489-495. 

Strand: taficatesDNA strand from whto 

Ntposifon: Indicates nucteo&tepodfons of predicted axons. 



15 



20 



Pkey Ref Strand NtposiUon 
404561 9795980 Minus 69039-70100 
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Table 25 

WA^exonlS^ 
SeqlDNO: 1 DNAseguenct 

NudM Acid Accession #: FGENESH predicted ORF 

Coding sequence 1-1518 (nnderimcd se qu e n ces correspond to abut and slop eodom) 
I II 21 31 41 » 

ATTCAGOCCTCCnKJCTICA OOAACTCATO GCTCACCCCT TCTTOCTGCT GATOCnXTC 60 
TG^TOTCTCTCCTGCTGTT TCAGGTAATC AGGTTGTACC AGAGGAGGAG ATGGATCATC 120 
ISW^ ACCTGTmX; TGCACCCCCT «XX^CT«nTCTATX3GCCAC^AGGAGm 180 
TAttCACTAA AGGAGTTTGA GGTGTATCAT AAGCTG ATGG AAAAATACCC ATOTGCTGTT 240 

ccCTrorcGaTTCGACCcrr tacgatottc ttcagtgtcc atgacccaoa ctatgocaag 300 

ATTTTTOCTGA AAAQACAAQA TCCCAAAAGT GCTGTTAGCC ACAAAATCCT TGAATCCTGG 3fO 
G^TStoSg GACTTCTGAC CCTGGATCGT TCTAAATGGA AAAAGCACCO CCAGATTGTG 420 
SvACCTCGCT TCAACATCAG CATTCTGAAA ATATlX^T^CCATt^^T^AGTOIT 480 
CTOATOATCC TOAACAAATG OGAGGAACGC ATTGCCCAAA ACTCACGTCT GGAGCTCTTT 540 
C^CA^CT COCTCATGAC CCTGGACAGC ATCATCAAGT GTGCCTTCAG CCACCAGGGC 60O 
AGCATCCAGTTCGAC^GTAC CCTCGACTCA TACCTOAAAG CAGTTGT^ OT^AA 660 
ATfMCAACC AGCGCATGAA CAATTTTCTA CATCACAACG ACCTGGTnT CAAATTCAGC 720 
TCTC^WOre AAATCnTtCTAAATTTAAC CAACSAACITC ATCAGTTCAC AOAOAAACTA 780 
SS^G^ G^SoAGTC TCTTAAGGAT AAGCTAAAAC AAGATACTAC ^GAAAAGO 840 

CGCTGGG ATT TTCTGGACAT A ^J3JT^ < fZ^^S^^2^^^»^fAr?T^4 r-ATrA^ATTX: ^60 



AGTRCTAT^r^CCTGGATCCTTrACTGCTTG GCAAAGTACC CTOAGCATCA GCAGAGATGC 1020 
^S^S^ia^^ACI «<n™^ "AC^A A^C^AGC ,0*. 
OkO ATCCCTTACACCACGAT OTOCATCAAO OAATGOCTCC GOCTCTACGC ACCGGTAOTA 1140 
AACATAToS GGTTVVCTCGA CAAACCCATC ACCTTTCCAG ATGGACGCTC CTTACCTGCA 1200 
G^tSSS TATTTGGGCT ™^ A^AmC^GGAAOAC 2£ 

mr- Anmtrr ttaacccctt gagattctcc agggaaaatt ctgaaaaaat acatccctat 1320 

G^TCATAC CATTCTCAGC TGGATT AAGG AACTOCATTO QGCAGCATTT TGCCATAATT 1380 
oS^SS ATTAACTCTG CTCCGCTTCA A«nX3GCTCC AGACCACTCA 1440 
AGGCCTCCCC AGCCTGTTCG TCAAGTTGTC CTCAAOTCCA AGAATOGAAT ^TOTOTTT 500 

a iiTATiTA a rrr Ann ATAC TTCTGACTGG TTTTGACATC CATTAACAGT AA 



AATTATAJAA CTTAGGATAC TTCTGACTGG TTTTGACATC CATTAACAGT AATTTTAATT 1680 



TCTTTGCTGT ATCTGGTGAA ACtXt&MM awu^iww* '^^^"^ -"~;i™^T .™ 
TOCGAAGGGA AATTATTGGT TTGTGTAACT AGTGGTAG AG TGGCTTTCAA GCATAGTTTG 1800 

TTATTCTCAG TTATCTTTCC CCAATAATAA AAAA 

SeqlDNO: 2 frotcro sequence: . 
Protra Accession*: FGENESH predicted g 

| 11 21 3! 41 51 

MHPSWLOELf AHPFLLULL CMSLIXFQVI RLYQRRRWMI RALHLFPAPP AHWFYGHKEF €0 
AH^^S^WH KIJNffilCyPCAV PLWVGPFIMF FSVHDPDY AK IIXKRQDPKS AVSHKHESW 120 
VGRGLVTLDG SKWKKHRQIV KPGFNISHJC fflTMMSESV RMMLNKWBER IAQNSRLBLF 180 
(fflVSLWnJJSIMKCAPSHQ^ *0 
SOG OIFSKFN QELHQFTEKV IQDRKESLKD KXKQDTTQKR RWDFLDHXS AKSENTKDFS 300 
^LOAEVKTFMFAGHDTTS SA1SWILYCL AKYPEHQQRC RDEIRELLGD GSS1TWEHLS 360 
OMPYTIMCIK ECLRLYAFW N1SRLLDKP1 TFPDGRSLPA G1TVFINIWA LHHNPYFWED 420 
PQVFNPLRFS RENSEKIHFY AFIPFSAGLR NCIGQHFAII ECKVAVALTL LRFKLAPDHS 480 
RPPQPVRQW LKSKNGIHVF AKKVC 

Seq ID NO: 3 DNA sequence 

Nuclek Add Accession NM_052997 

Coding sequence: 1004125 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

CTAGTCTATA CCAGCAACGA CTCCTACATC GTCCACTCTG GGGATCTTAG AAAGATCCAT 60 
AAAGCTGCCT CCCGGGGACA AGTCCGGAAG CTGGAGAAGATjGACAAAGAG GAAGAAG ACC 120 
ATCAACCTTA ATATACAAG A CGCCCAGAAG AGG ACTGCTC TACACTGGGC CTGTGTCAAT 180 
GGCCATGAGG AAQTAGTAAC ATTTCTGGTA G ACAGAAAGT GCCAGCTTGA CGTCCTTGAT 240 
GGCGAACACA GGACACCTCT GATGAAGGCT CTACAATGCC ATCAGGAGGC TTGTGCAAAT 300 
ATTCTGATAG ATTCTGGTGC CG ATATAAAT CTCGTAGATG TGTATGGCAA CATGGCTCTC 360 
CATTATGCTG TTTATAGTGA GATTTTGTCA GTGGTGGCAA AACTGCTGTC CCATGGTGCA 420 
GTCATCGAAG TGCACAACAA GGCTAGCCTC ACACCACTTT TACTATCCAT AAOG AAAAGA 480 
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CTTtnTCAGC AAAATGTTCA CGTCTTTOCT GCAGATATAT GTGGAGTAAC TO^GAACAT 560 
tTt/J£™ rTn^A^TCATCACATT CATC AACAAA TTATGGAATA TATACGAAAA 720 

A^^Ar^^S^ AACAGTCA GCAGAAGAAA CACCTAGGGA AATTACGAGT 1020 

A^^^ST* £S2£gGATC OTG^CAGA GTAACATCTA ATAAAACTAA AGTTTTGGAA 1260 
I^Sa^SaaOA^T^^ 1320 

TCTOTOA^TC '^n^SEaW '^^^CA^^GATTCAAQTGT GTATACCTGA ^^^E^J^LJ^^ 

TOGAGATAAA TAGAGAAGTA GAAGAGCCTC CTA^^ ATCTG^ 
A^rZZnrriTmt. A ATTICA AAACICTOTT CCAAATAAAO CCTTTQAATT GAAGAATGAA 1560 
^a^Sa" ^A^^ GATOm^ C^^-KX^ AACAAAAGGA CTATGAAGAA 1620 

?^^^^^^e^^AATAA\(XX^TGGKA-nGAA AAATOAACAA 1920 
TCTTGGGATA CTGAGAGTCT CTGTOAGACT GTTTCACAQA AGGATGTCTGTTTACOCAAG 2040 

SSS^A^A^S^^ 

TCGGATTCTG AGAGTCICTG TGAGACTGTT ^^M^AT^T^^^^^ 

A^^U^^C AGA1GTTCCC TTCAGAA.TCA **ACAAAAGA^^CTTOAMA AA^T^TroO^OTO 
GATTTTTCAGA GTCTCOGTGA GACTGTTTCA CAGAAGGATG TGTGTCTACC CA^^™^ 
^^^AAA^^TOGATAA AATAAGTGGA AAATTAGAAG ATTCAACTAG CX7TATCAAAA 2K0 
J^TO^ACAan^nCTnrTGAAAGA GCAAGGGAAC TTCAAAAAGA TCACTGTGAA 2880 
^S^Ia^AyuSSiTCGA ACAAATOAAA AAGAAGTmGTCTACIOAAAAAOAAACTO 2*0 
TT^fiAAGCAA AAGAAATAAA ATCACAGTTA GAG AACCAAA AAGTT AAATG GGAACAAG AG 3000 
CTCTOCAff^TOAGATTCACT^AAACCAA GAAG AAGAGA AGAGAAGAAA TGC^TATA 3060 
SSSt^/SSStagcga AGAATTAGGA AGAATCGAAG AGCAGCATAG GAA^AGTTA 31M 
AS AACAACTTGA ACAGGCTCTC AGAATACAAG ATATAGAATT QAAGAGTGTA 3180 

rl^A^^^TGl^^^rrCTCACACT CATGAAAATG AAAATTATCT CTTACATGAA 3240 

2^^ga^aSa£^ 

CAATACCAGG AAAAGGAAAA TAAATACTTT GAGGACATTA AGATTTTAAA AOAAAAOAAT 3360 

SJoaI^aSa^cotaaaactoaaag^^ 

50 TATAOTTXWCAGCTTAAAOTTCIGATAGCT^ 

GAAAAACAAG ACAAAGAAAT ACTAGAGGCA <^JJ2^\^^^J^^^^^36o6 
TCTOCTOTAC AAGACCATQA TCAAATTOTG ACATCAAOAA AAAO^GA AOTOCTTTC 3600 
CACATTGCAG GAGATGCTTG TTTGCAAAGA AAAATGAATO TTQATGTGAO TAGTACGATA 3660 
5aTAACAATO AGOTGCTCCA TCAACCACTT TCTGAAGCrC AAAGQAAATC CAAAAGCCTA 3720 

55 AAAATTAATC TC^AITATGC AGGAGATGCT ^AA0AGAAAATACATTGGTTTCAaAACA^37M^ 
GCACAAAGAO ACCAACGTGA AACACAGTGTCAAATGAAGG AAGCIOAACACATCTATCAA 3840 

aTSaacaag aSatSgaa caaacacactgaacaoc^ aotc^™ 

Tm^ACAAAGCAAAAATAT(nt3GCTTCAACAG^ »• 
OCTGACAACA AAAGCAAGAT AACAATTGAT ATTCATTOTI^AGaAA AATGCAACAT «B0 

£££^TO3AG^CA ACAGACCAG A TCTTTACICA CAACTCATGC TAGGAGGCCA «M0 
GTCCTAGCATCACCTTATXjT TOAAAATCTT ACCAATAGTC TOTGTCAA^ G A ATAmAT 4M0 
T^GAAOAA AAATTCATGA TTTCTTCCrG AAGCCTACAG ACAT AAAATA ACAGTGTCAA 4320 
65 GAATTACTTOTTCACGAATT GCATAAAGCTGCACAGGATTCCCATCTACC CTGATGATGC 4380 
AGCAGACATC ATTCAATCCA ACCAG AATCT CGCTCTGCAC TCCAGCCTAG GTOACAGAGT 4440 
GAGACTCCAC CTCGGAAA 

70 SeqIDNO:4ErDtein 



30 

35 
40 



Protein Accession ft NP.443723.1 
1 II 21 31 41 51 

75 MTKRKKTINt. WQDAQKRTa'lHWACVNGHB BWTFLVDRK CQLDVLDGEH RTP1MKALQC 60 
iSd^vd VYGNMALHYA vysbilswa KLLSHGAVIE VHNKASLTPL 120 
ISlTKBMQ NTiFLUKNA NANAVNKYKC TALMLAVCHO SS0VGMLLQ QNVDVFAAD1 ISO 
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CGVTAEHYA V TCGFHHIHEQ IMBYIRKLSK NHQNTNPEGT SAGTPDEAAP LAERTPDTAE 240 
SLVEKTPDEA APLVERTPDT AESLVEKTPD EAASLVEOTS DKIQCLEKAT SGKFEQSAEE 300 
TPRETTSPAK ETSEKFTWPA KGRPRKIAWB KKEDTPREIM SPAKETSEKF TWAAKGRPRK 360 
IAWEKKETPV KTGCV ARVTS NKTKVLEKGR SCM1ACPTKB SSTKASANDQ RFPSESKQEB 420 
5 DEEYSCDSRS LFESSAK1QV OPBSIYQKV MEINREVHEP PKKPSAFKPA JEMQNSVPNK 4S0 

AFELKNEQTL RADPMFPPES KQKDYEENSW DSESLCETVS QKDVCLPKAT HQKHDKING 540 
KLEESPNKDG LLKATCGMKV SUTXALELIC DMQTFKAEPP GKPSAFEPAT EMQKSVFNKA 600 
LELKNBQTWR ADEELPSESK QKD YBENSWD TESLCETVSQ KD VCLPXAAH QKEIDKINGK 660 
LEGSPVKDGL LKANCGMKVS IFTKALELMD MQTFKAEPPE KPSAFEPAIB MQKSVPNXAL 720 

10 ELKNEOTLRA DEILPSESKQ KDYEESSWDS ESLCETVSQK DVCLPKATHQ KEIDKINGKL 780 
EESPDNDGFL KAPCRMKVSI PTKALELMDM QTFKAEPPEK PSAFEPAIEM QKSVPNKALE 840 
LKNBQTLRAD QMFPSBSKQK KVEENSWDSE SLRBTVSQKD VCVPKATHQK EMDKISGKLE 900 
DSTSLSKILO TVHSCERARB LQKDHCEQRT GKMEQMKKKF CVLKKKLSEA KHKSQLENQ 960 
KVKWEQHjCS VRLTLNQEEE KRRNADILNE KIREELGRIB EQHRKELEVK QQLEQALRIQ 1020 

15 DIELICSV^SN LNQVSHTHEN ENYLLHENCM LKKEIAMLKL ELATLKBQYQ HKENKYFEDI 1080 
KILKEKNAEL QMTLKLXBES LTKRASQYSG QUCVLZAENTMLTSKLKEKQ DKEZLBAEZB 1140 
SHHPR1ASAVODHDQIVTSR KSQEPAFHIA GDACLQRKMN VDVSSTIYNN EVLHQPLSEA 1200 
QRKSKSUON LNYAGDALRB NTLVSEHAQR DQRETQCQMK EAEHMYQNEQ DNVNKHTEQQ 1260 
ESLDQKLFQL QSKNMWLQQQ LVHAHKKADN KSKmWHF LERKMQHHLL KEKNEEIFNY 1320 

20 NNHLKNRIYQ YEKEKAETEN S 



25 



Seq ID NO: 5 DNA s 

Nucleic Acid Accession #: Dtmefbtmd 
Coding sequence: 273-1785 (underlined sequences contspond to start and stop codons) 



1 U 21 31 41 51 

CGCGGGGCGC GGAGTCGGCG GGGCCTCGCG GGACGCGGGC AGTGCGGAGA CCGCGGCGCT 60 

30 G AGGACGCGG GAGCCGGGAG CGCACGCGCX3 GGGTGGAGTT CAGCCTACTC TTTCTTAGAT 120 
GTGAAAGGAA AGGAAGATCA TTTCATGCCT TGTTGATAAA GGTTCAGACT TCTGCTGATT 180 
CATAACCATT TGGCTCTGAG CTATGACAAG AGAGGAAACA AAAAGTTAAA CTTACAAGCC 240 
TGCCATAAGT GAGAAGCAAA CTTCCTTGAT AACATGCTTT TGCGAAOTGC AGGAAAATTA 300 
AATGTGGGCA CCAAGAAAGA GGATGGTGAG AGTACAGCCC CCACCCCCCG TCCAAAGGTC 360 

35 TTGCGTTGTA AATGCCACCA CCATTGTCCA GAAGACTCAG TCAACAATAT TTGCA GCACA 420 
GACGGATATT GTTTCACGAT GATAGAAGAG GATOACTCTG GGTTCCCIOT GGTCAC7TCT 480 
GGTTGCCT AO G ACTAGAAGG CTCAGATTTT CAOTGTCGGG ACACTCCCAT TCCTCATCAA 540 
AGAAGATCAA TTGAATGCTG CACAGAAAGG AACGAATGTA ATAAAGACCT ACACCCTACA 600 
CTGCCTCCAT TGAAAAACAG AG ATlflO ll GATGGACCTA TACACCACAG G GCTTTA CTT 660 

40 ATATCTGTOA CTOTCTGTAG TTTGCTCTTG GTCCTTATCA TATTATTTTG TTACTTCCGG 720 

TATAAAAGAC AAGAAACCAG ACCTCGATAC AGCATTGGGT TAGAACAGGA TQAAACTTAC 780 
ATTCCTCCTG GAGAATCCCT GAGAGACTTA ATTGAGCAGT CTCAGAGCTC AGGAAGTGOA 840 
TCAGGCCTCC CTCTGCTGGT CCAAAGGACT ATAGCTAAGC AGATTCAGAT GGTGAAACAG 900 
ATTGGAAAAG GTCGCTATGG GGAAGTTTGG ATGGGAAAGT GGCGTGGCGA AAAGGTAGCT 960 

45 GTGAAAGTGT TCTTCACCAC AGAGGAAGCC AGCTGGTTCA G AGAGACAGA AATAT ATCAG 1020 
ACAGTGTTGA TGAGGCATGA AAACATTTTG GGTTTCATTG CTOCAQATAT CAAAGGGACA 1080 
GGGTCCTGGA CCCAGTTGTA CCTAATCACA G ACTATCATG AAAATGGTTC CCTTTA7/GAT 1140 
TATCTGAAGT (XACCACXXT AGACGCTAAA TCAATGCTGA AGTTAGCCTA CTCTTCTGTC 1200 
AGTGGCTTAT GTCATTTACA CACAGAAATC TTTAGTACTC AAGGCAAACC AQCAATTQCC 1260 

50 CATCGAGATC TGAAAAGTAA AAACATTCTG GTQAAGAAAA ATGG AACTTG CTGTATTGCT 1320 
GACCTGGGCC TGGCTGTTAA ATTTATTAGT GATACAAATG AAGTTGACAT ACCACCTAAC 1380 
ACTCGAGTTG GCACCAAACG CTATATGCCT CCAGAAGTGT TGOACGAOAO CTTOA ACAOA 1440 
AATCACTTCC AGTCTTACAT CATGGCTGAC ATGTATAGTT TTGGCC TCAT CCTITGGQAG 1500 
QTTGCTAGOA OATGTOTATC AGOAGGTATA QTQGAAG AAT ACCAGCTTCC TTATCATGAC 1560 

55 CTAGTGCCCA GTGACCCCTC TTATGAGGAC ATGAGGG AGA TTGTOTGCAT CAAGAAGTTA 1620 
CGCCCCTCAT TCCCAAACCG GTGGAGCAGT GATOAGTGTC TAAGGCAGAT GGGAAAACTC 1680 
ATGACAGAAT GCTGGGCTCA CAATCCTGCA TCAAGGCTGA CAGCCCTGCG GGTTAAGAAA 1740 
ACACTTGCCA AAATGTCAGA GTCCCAGGAC ATTAAACTCT GATAGGAGAG GAAAAGTAAG 1800 
CATCTCTGCA GAAAGCCAAC AGGTACTCTT CTGTrTGTGG GCAGAGCAAA AGACATCAAA 1860 

60 TAAGCATOCA CAGTACAAGC CTTGAACATC GTOCTGCTTC CCAGTGGGTT CAG ACCTCAC 1920 
CTTTCAGGGA GCGACCTGGG CAAAGACAGA GAAGCTCCCA GAAGGAG AGA TTGATCCGTG 1980 
TCTGTTTGTA GGCGG AG AAA CCGTTGGGT A ACTTGTTCAA GATATGATGC AT 

65 Seq ID Ntt 6 Pjyffn sequence: 

none found 



I 11 21 31 41 51 

^ ■ MIXBSAGION VGTKKEDGES TAPTPRPKVL RCKCHHHCPE DSVNNICSTD GYCFTMIEED 60 
PSGLPWTSG CLGLEGSDFQ CRJDTPIPHQR RSIECCTERN ECNKDLHPTL PPLJKNRDFVD 120 
GPfHHRALU SVTVCSLLLV LHLFCYFRY KRQETRPRYS IGLEQDETYI PPGESLR DU 18 0 
EQSQSSGSGS GLPLLVQRTI AKQIOMVKQI GKGRYGEVWM GKWRGEKVAV KVFFTTEEAS 240 

75 WFRETEIYOT VLMRHENILG FIAADKGTG SWTQLYUTD YHENGSLYDY LKSTTLDAKS 300 
MLKLAYSSVS GLCHLHTEIF STQGKPAIAH RDLKSKNILV KKNGTCOAD LGLA VKFISD 360 
XNEVDIPPNT RVGTKRYMPP EVLDESLNRN HFQSY1MADM YSFGLILWBV ARRCVSGGIV 420 
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EEYQLPYHDL VPSDPSYEDM REIVOKKLR PSFPNRWSSD ECLRQMGKLM TECWAHNPAS 480 
RLTALRVKKT LAKMSESQD1 KL 

Seq ID NO: 7 DNA sequence 

Nucleic Acid Accession #: . none found 

Coding sequence: 482-3007(undeiiined sequences correspond to start and stop codoos) 



1 U 21 31 41 51 

10 | ] i i i ) _ 

AACTGAGCTA ACAAGAAATA CTAGAAAAGG AGGAAGGAGA ACATTGCTGC AGC TTGOATC 60 
TACAACCTAA GAAAGCAAGA GTGATCAATC TCAGCTCTGT TAAACATCTT GTTTACTTAC 120 
TGCATTCAGC AGCTTGCAAA TGGTT AACTA TATGCAAAAA AGTCAGCATA GCTGTGAAGT 180 
ATGCCGTGAA TTTTAATTQA GGGAAAAAGG GACAATTGCT TCAQGATGCT CTA OTAT OCA 240 

15 CTCTCCTTGA AATATTTTCA ATGAAATGCT CAGTATTCTA TCTTTGACCA GAGGTnTAA 300 

CTTTATGAAG CTATGGGACT TOACAAAAAG TGATATTTGA GAAGAAAGTA CGCAGTGGTT 360 
GGTGTTTTCT TTTTTTTAAT AAAGGAATTO AATTACTTTG AACACCTCTT CCAGCTGTGC 420 
ATTACAG ATA ACGTCAGGAA GAGTCTCTGC TTTACAGAAT CGGATTTCAT CACATGACAA 480 
CATGAAGCTG TGGATTCATC TCTTTTATTC ATCTCTCCTT GCCTGTATAT CTTTACACTC 540 

20 CCAAACTCCA GTGCTCTCAT CCAGAGGCTC TTGTGATTCT CTTTGCAATT GTGAGGAAAA 600 
AGATGGCACA ATGCTAATAA ATTGTG AAGC AAAAGGTATC AAGATGGTAT CTGAAATAAG 660 
TGTGCCACCA TCAOSACCTT TCCAACTAAG CTTATTAAAT AACGGCTTGA CG ATGCTTCA 720 
CACAAATG AC TTTTCTGGGC TTACCAATGC TATTTCAATA CACCTTGGAT TTAACAATAT 780 
TOCAGATATT GAG ATAGGTG CATTTAATGG CCTTGGCCTC CTGAAACAAC TTCATATCAA 840 

25 TCACAATTCT TTAOAAATTC TTAAAGAGQA TACTTTOCAT QGAC TOGAAA ACCTGGAATT 900 
CCTGCAAGCA GATAACAATT TTATCACAGT GATTQAACCA AGTGCCTTTA GCAAGCTCAA 960 
CAGACTCAAA GTGTTAATTT TAAATGACAA TGCTATTGAG AGTCTTCCTC CAAACATCTT 1020 
CCGATTTGTT CCTTTAACCC ATCTAOATCT TCQTGGAAAT CAATTACAAA CATTGCCTTA 1080 
TGTTGGTTTT CTCGAACACA TTGGCCG AAT ATTGGATCTT CAGTTGGAGG ACAACAAATG 1140 

30 GGCCTGCAAT TGTGACTTAT TGCAGTTAAA AACTTGGTTG GAGAACATGC CTCCACAGTC 1200 
TATAATTGGT GATGTTGTCT GCAACAGCCCTCCATTTTTT AAAGGAAGTA TACTCAGTAG 1260 
ACTAAAGAAG GAATCTATTT GCCCTACTCC ACCAGTGTAT G AAGAACATG AGG ATCCTTC 1320 
AGGATCATTA CATCTGGCAG CAACATCTTC AATAAATGAT AGTCGCATGT CAACTAAGAC 1380 
CACGTCCATT CTAAAACTAC CCACCAAAGC ACCAGGTTTG ATACCTTATA TTACAAAGCC 1440 

35 ATCCACTCAA CTTCCAGGAC CTTACTGCCC TATTCCTTGT AACTGCAAAG TCCTATCCCC 1500 
ATCAGGACTT CTAATACATT GTCAGGAGOG CAACATTGAA AGCTTATCAO ATCT GAGACC 1560 
TCCTCCGCAA AATCCTAGAA AGCTCATTCT AGCGGGAAAT ATTATTCACA GTTTAATGAA 1620 
GTCTGATCTA GTGG AATATT TCACTTTGGA AATGCTTCAC TTGGGAAACA ATCGTATTGA 1680 
AGTTCTTGAA GAAGGATCGT TTATGAACCT AACG AGATT A CAAAAACTCT ATCTAAATGG 1740 

40 TAACCACCTG ACCAAATTAA GTAAAGGCAT GTTCCTTGGT CTCCATAATC TTGAATACTT 1800 
ATATCTTG AA TACAATGCCA TTAAGGAAA7 ACTGCCAGGA ACCTTTAATC CA ATGCCT AA 1860 
ACTTAAAGTC CTOTATTTAA ATAACAAOCT CCTCCAAGTT TTAOCACCAC ATATTTTTTC 1920 
AGGGGTTCCT CTAACTAAGG TAAATCTTAA AACAAACCAG TTTACCCATC TACCTGTAAG 1980 
TAATATTTTG GATGATCTTG ATTTACTAAC CCAGATTGAC CTTGAGGATA ACCCCTGGGA 2040 

45 CTGCTCCTQT GACCTGGTTG GACTGCAGCA ATGGATACAA AAOTTAAGCA AGAACACAGT 2100 
GACAGATGAC ATCCTCTGCA CTTCCCOCGG GCATCTCGAC AAAAAGGAAT TGAAAGCCCT 2160 
AAATAGTGAA ATTCTCTGTC CAGGTTTAGT AAATAACCCA TCCATGCCAA CACAGACTAG 2220 
TTACCTTATG GTCACCACTC CTGCAACAAC AACAAATACG GCTO ATACT A TTTTACOATC 2280 
TCTTACGGAC GCTGTGCCAC TGTCTGTTCT AATATTGGG A CTTCTGATTA TGTTCATCAC 2340 

50 TAI im 1 1 I V 10 1 GC TGCAG GGATAGTGGT t Vl i til KT I CACCGCAGGA GAAGATACAA 2400 
AAAGAAACAA GTAGATGAGC AAATGAGAGA CAACAGTCCT GTCCATCTTC AGTACAGCAT 2460 
GTATGOCCAT AAAACCACTC ATCACACTAC TO AAAGACCC TCTGCCTCAC TCTATG AACA 2520 
GCACATGGTG AGCCCCATGG TTCATGTCTA TAGAAGTCCA TCCTTTGGTC CAAAGCATCT 2580 
GGAAGAGGAA GAAGAGAGG A ATGAGAAAG A AGGAAGTGAT GCAAAACATC TCCAAAG AAG 2640 

55 TCTTTTGOAA CAGGAAAATC ATTCACCACT CACAGGOTCA AATATGAAAT ACAAAACCAC 2700 
GAACCAATCA ACAGAATTTT TATCCTTCCA AG ATGCCAGC TCATTGTACA GAAACATTTT 2760 
AGAAAAAGAA AGGGAACTTC AGCAACTGGG AATCACAGAA TACCTAAGGA AAAACATTGC 2820 
TCAGCTCCAG CCTGATATGG AGGCACATTA TCCTGGAGCC CACGAAGAGC TGAAGTTAAT 2880 
GG AAACATTA ATOTACTCAC GTCCAAGGAA GGTATTAGTG GAACAOACAA AAAATGAGTA 2940 

60 TTTTGAACTT AAAGCTAATT TACATGCTGA ACCTGACTAT TTAGAAGTCC TGGAGCAGCA 3000 
AACATAGATG GAGAGTTTGA GGGCTTTCGC AGAAATGCTG TGATTCTGTT TTAAGTCCAT 3060 
ACCTTGTAAA TAAGTGCCTT ACGTGAGTGT GTCATCAATC AGAACCTAAG CACAGCAGTA 3120 
AACTATGGGG AAAAAAAAAG AAGAAGAAAA GAAACTCAGG GATCACTGGG AGAAGOCATO 3180 
GCATTATCTT CAGGCAATTT AGTCTGTCCC AAATAAAATC AATCCTTGCA TGTAAATC 

65 

Scott) NO. 8 Protein sequence 
Protein Accession #: none found 

70 1 1! 21 3t 41 51 

InCLWIHLFYS SLLACISLHS QTPVLSSRGS CDSLCNCEEK DGTMUNCEA KGIKMVSEIS 60 
VPPSRPFQLS LLNNGLTMLB TNDFSGLTNA ISIHLGFNNI ADIHGAFNG LGLLKQLHIN 120 
pNSLEH *ED TFHO',™^ tjQADNNFTTV 1EPSAFSKLN R1JCVULNDW AIESLPPNIF 180 
75 RFVPLTHLDL RGNQLQTLPY VGFLEHK3RI LDLQLE DNKW ACNCDLLQUC TWLENMPPQS 240 
HGDWCNSP PFFKGSILSR LKKES1CPTP PVYEEHEDPS GSLHLAATSS INDSRMSTKT 300 
TSHXLPTKA PGUPYITKP STQLPGPYCP IPCNCKVLSP SGLUHCQER NIBSLSDLRP 360 
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PPQNPRKLIL AGNIIHSLMK SDLVEYFTLB MLHLGNNRIE VLEEGSFMNL TRLQKLYLNG 420 
NHLTKLSKOM FLGLHNLEYL YLEYNAKEI LPGTFNPMPK LKVLYLNNNL LQVLFPHIFS 480 
GVPLTKVNLK TNQFTHLPVS NILDDLDLLT QIDLEDNFWD C9CDLVQLQQ WIQKLSKNTV 540 
TDDflJCTSPG HLDKKEUCAL NSEILCPGLV NNPSMFTQTS YLMVTTPATT TKTADTILRS 600 
5 LTDAVPLSVL OjOLLIMFIT 1WCAAGIW LVLHRRRRYK KKQVDEQMRD NSPVHLQYSM 660 
YGHKTTHHTT ERPSASLYBQ HMVSPMVHVY RSPSFGPK HL EEB EERNBKB GSDAKHLQRS 720 
IXEQENHSPL TGSNMKYKTT NQSTEFLSFQ DASSLYRNIL EKERELQQLG ITEYLRKNIA 780 
QLQPDMEAHY PGAHEELKLM ETLMYSRPRK VLVEQTKNEY FELKANLHAE PDYLEVLBQQ 840 



Seq ID NO: 9 DNA sequence 

Nucleic Acid Accession 8: NM JD03474 

Coding sequence: 507-3036 (tmdsxtined sequences correspond to start and stop codons) 

15 

1 II 21 31 41 51 

CACTAACGCT CTTCCTAGTC CCCGGGCCAA CTCGGACAGT TTGCTCATTT ATTGCAACGG 60 
TCAAGGCTGG CTTGTGCCAG AACGGCGCGC GCGCOACGCA CGCACACACA CGGGGGGAAA 120 

20 CTTTTTTAAA AATGAAAGGC TAGAAGAGCT CAGCGGCGGC GCGGGCCGTG CGCGAGGGCT 180 
CCGGAGCTGA CTCGCCGAGG CAGGAAATCC CTCCGGTCGC GACGCCCGGC CCCGCTCGGC 240 
GCCCGCGTGG GATGGTGCAG CGCTCGCCGC CGGGCCCGAG AGCTGCTGCA CIT3AAGGCCG 300 
GCGACGATGG CAGCGCGCCC GCTGCCCGTG TCCCCCGCCC GCGCCCTCCT GCTCGCCCTG 360 
GOCGGTGCTC TGCTCGCGCC CTGCGAGGCC CGAGGGGTGA GCTTATGGAA CGAAGGAAGA 420 

25 GCTGATGAAG TTGTCAGTGC CTCTGTTCGG AGTGGGGACC TCTGGATCCC AGTGAAGAGC 480 
TTCOACTCCA AGAATCATOC AGAAGTGCTO AATATTCG AC TAC AAOGG O A AAGCAAAGAA 540 
CTGATCATAA ATCTGGAAAG AAATGAAGGT CTCATTOCCA GCAGTTTCAC GGAAACCCAC 600 
TATCTGCAAG ACGGTACTGA TGTCTCCCTC GCTCGAAATT ACACGGTAAT TCTGGGTCAC 660 
TGTTACT ACC ATGGACATGT ACGGGG ATAT TCTGATTCAG CAGTCAGTCT CAGCACGTGT 720 

30 TCTGGTCTCA GGGGACTTAT TGTGTTTGAA AATGAAAGCT ATGTCTTAGA ACCAATGAAA 780 
AGTGCAACCA ACAGATACAA ACTCITOCCA OCGAAGAAGC TGAAAAGCGT CCGGG G ATC A 840 
TGTGGATCAC ATCACAACAC ACCAAACCTC GCTGCAAAGA ATGTGTTTCC ACCACCCTCT 900 
CAGACATGGG CAAGAAGGCA TAAAAGAGAG ACCCTCAAGG CAACTAAGTA TGTGGAGCTG 960 
GTGATCGTGG CAGACAACCG AGAQTTTCAO AGGCAAGGAA AAGATCTGGA AAAAGTTAAG 1020 

35 CAGCGATTAA TAGAGATTGC TAATCACGTT GACAAGTTTT ACAGACCACT GAACATTCGG 1080 
ATCGTGTTGG TAGGCGTGGA AGTGTGGAAT GACATOGACA AATCCTCTGT AAGTCAGGAC 1140 
CCATTCACCA GCCTCCATGA ATTTCTGGAC TGGAGGAAGA TO AAGCTTCT ACCTCGCAAA 1200 
TCCCATGACA ATGCGCAGCT TGTCAGTGGG GTTTATTTCC AAGGGACCAC CATCGGCATG 1260 
GCCCCAATCA TGAGCATGTG CACGGCAGAC CAGTCTGGGG GAATTGTCAT GGACCATTCA 1320 

40 GACAATOCCC TTGGTGCAGC OGTQACOCTG GCACATGAGC TGOGOCACAA TTTCGGGATO 1380 
AATCATGACA CACTGGACAG GGGCTGTAGC TGTCAAATGG CGGTTGAGAA AGGAGGCTGC 1440 
ATCATGAACG CITCCACCGG GTACCCATTT CCCATGGTGT TCAGCAGTTG CAGCAGGAAG 1500 
GACTTCG AG A CCAGCCTGG A G AAAGG AATG GGGGTGTGCC TGTTTAACCT GCCGGAAGTC 1560 
AGGGAOTCTT TCGGOGGCCA GAAGTGTGGG AACAGATTTG TGGAAGAAGO AGAGGAGTCT 1 620 

45 GACTGTGGGG AGCCAGAGGA ATGTATGAAT CGCTGCTGCA ATGCCACCAC CTGTACCCTG 1680 
AAGCCGGACG CTGTGTGCGC ACATGGGCTO TGCTQTGAAG ACTGCCACCT GAAGCCTGCA 1740 
GGAACAGCGT GCAGGGACTC CAGCAACTCC TGTGACCTCC CAGAGTTCTO CACAGGGGCC 1800 
AGCCCTCACT GCCCAGOCAA COTGTAOCTG CACOATOOGC ACTCATGTCA GGATOTGGAC 1860 
GOCTACTOCT ACAATGGCAT CTGCCAGACT CACGAGCAGC AGTGTOTCAC ACTCTGCGGA 1920 

50 CCAGGTGCTA AACCTGCCCC TGGGATCTGC TTTGAGAGAG TCAATTCTGC AGGTGATCCT 1980 
TATGGCAACT GTOGCAAAGT CTCGAAGAGT TCCTTTGCCA AATGCGAGAT G AGAG ATGCT 2040 
AAATGTGGAA AAATCCAQTO TCAAOOAGGT GCCAGOCGGC CA GTCAT TGG TACCAATGCC 2100 
GTTTCCATAG AAACAAACAT CCCCCTGCAG CAAGGAGGCC GGATTCTGTG CCGCGGGACC 2160 
CACGTGTACT TGGGCGATGA CATGOCGGAC CCAGGGCTTG TGCTTGCAGG CACAAAGTGT 2220 

55 GCAGATGGAA AAATCTGCCT GAATCGTCAA TGTCAAAATA TTAGTGTCTT TGGGGTTCAC 2280 
GAGTGTGCAA TGCAGTGCCA CGGCAGAGGG GTGTGCAACA ACAGGAAGAA CTGCCACTGC 2340 
GAGGCCCACT GGGCACCTCC CTTCTGTGAC AAGTTTGGCT TTGO AOOAA G CACAGACAGC 2400 
GGCCCCATCC GOCAAGCAQA TAACCAAGOT TTAACCATAO GAATTCTGGT GACCATCCTG 2460 
TGTCTTCTTG CTGCCGGATT TGTGGTTTAT CTCAAAAGGA AGACXHTGAT ACGACTGCTG 2520 

60 TTTACAAATA AGAAGACCAC CATTGAAAAA CTAAGGTGTG TGCGCCCTTC CCGGCCACCC 2580 
CXjTGGCTTCC AACCCTOTCA GGCTCACCTC GGCCACCTTG G AAAAGGCCT GATGAGGAAG 2640 
CCGCCAGATT CCTACCCACC GAAGGACAAT COCAGGAGAT TGCTGCAGTG TCAGAATOTT 2700 
GACATCAGCA GACCCCTCAA CGGCCTGAAT GTCCCTCAGC CCCAGTCAAC TCAGCGAGTG 2760 

crrccTCCcc tccaccgggc cccacgtgca cctagcgtcc ctoccagacx: cctgccagcc 2820 

65 AAGCCTGCAC TTAGGCAGGC CX^GGGGACX: TGTAAGCCAA A(XCCCCTCA GAAGCCTCTG 2880 
CCTGCAGATC CTCTGGCCAG AACAACTCGG CTCACTCATG CCTTGGCCAG GACCCCAGGA 2940 
CAATGGGAGA CTGGGCTCCG CCTGGCACCC CTCAGACCTG CTCCACAATA TCCACACCAA 3000 
GTGCCCAGAT OCACCCACAC CGCCTATATT AAGTGAGAAG (XGACAOCTT TTTTCAACAG 3060 
TGAAOACAGA AGTTTGCACT ATCTTTCAGC TCCAGTTGGA Ul l'J 1 1 IG'IA CCAACTTTTA 3120 

70 GGATrrnrr taatgtttaa aacatcatta ctataagaac tttgagctac tgccgtcagt 3iso 

GCTGTGCTGT GCTATGGTGC TCTGTCTACT TGCACAGGTA CTTGTAAATT ATTAATTTAT 3240 

gcagaatgtt gattacagtg cagtgogctg tagtaggcat ttttaccatc actg Aomrr 3300 

CCATGQCAGG AAGGCTTGTT G 1 UCIT 1 T A G TATTTTAGTO AACTT3AAAT ATOCTCCTTO 3360 
ATGGGATTCT GGACAGGATG llfi riXiClU CTGATCAAGG CCTTATTGGA AAGCAGTCCC 3420 
75 OCAACTACCC CCAGCTGTGC TTATGGTACC AGATGCAGCT CAAGAGATCC CAAGTAGAAT 3480 
CTCAGTTGAT TTTCTGG ATT CCCCATCTCA GGCCAG AGCC AAGGGGCTTC AGGTCCAGGC 3540 
TQTGTTTGGC TTTCAGGGAG GCCCTGTGCC CCTTGACAAC TGGCAGGCAG GCTCCCAGGG 3600 
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ACACCTGOGA OAAATCTGGC TTCTGOOCAO OAAGCTTTOO TGAOAACCTQ OGTTOCAQAC 3660 
AGGAATCTTA AGGTGTAGCC ACACCAGG AT AGAG ACTGGA ACACTAGACA AGCCAGAACT 3720 
TGACCCTGAG CTGACCAGCC GTGAGCATGT TTGG AAGGGG TCTGTAGTGT CACTCAAGGC 3780 
GGTGCTTGAT AGAAATGCCA AGCACTTCTT TTTCTCGCTG TCCTTTCTAG AGCACTGOCA 3840 
5 CCAGTAGGTT ATTTAGCTTG GGAAAGGTGG TGTTTCTGTA AGAAACCTAC TGCCGAGGCA 3900 
CTGCAAACCG CCACCTCOCT ATACTGCTTG GAGCTGAGCA AATCACCACA AACT GTAATA 3960 
CAATGATCCT GTATTCAGAC AGATGAGGAC TTTCCATGGG ACCACAACTA TTTTCAGATG 4020 
TGAACCATTA AOCAGATCTA GTCAATCAAG TCTGTITACT GCAAGGTTCA ACTTATTAAC 4080 
AATTAGGCAG ACTCTTTATG CTTGCAAAAA CTACAACCAA TGGAATGTGA TGTTCATGGG 4140 

1 0 TATAGTTCAT GTCTGCTATC ATTATTCGTA GATATTGGAC AAAGAACCTT CTCTATGGGG 4200 
CATCCTCTTT TTCfcAACTTG GCTGCAGGAA TCTTTAAAAG ATGCTTITAA CAGAGTCTGA 4260 
ACCTATTTCT TAAACACTTG CAACCTACCT GTTGAGCATC ACAGAATGTG ATAAGGAAAT 4320 
CAACTTGCTT ATCAACTTCC TAAATATTAT GAGATCTGGC TTGGGCAGCA TOCOCTTGAA 4380 
CTCTTCACrC TTCAAATGCC TGACTAGGGA GCCATGTITC ACAAGGTCTT TAAAGTGACT 4 440 

1 5 AATGGCATGA GAAATACAAA AATACTCAGA TAAGGTAAAA TOCCATG ATO CCTCTQTCTT 4500 
CTGG ACTGGT TTTCACATTA GAAGACAATT GACAACAGTT ACATAATTCA CTCTGAGTGT 4560 
TTTATGAGAA AGCCTTCTTT TGGGGTCAAC AGTTTTCCTA TGCTTTGAAA CAGAAA AATA 4620 
TGTACCAAGA ATCTTGGTTT GCCTTOCAGA AAACAAAACT GCATTTCACTTTCCOGGTOT 4680 
TCCCCACTGT ATCTAGGCAA CATAGTATTC ATGACTATGG ATAAACTAAA CACGTGACAC 4740 

20 AAACACACAC AAAAGGGAAC CCAGCTCTAA TACATTCCAA CTCGTATAGC ATGCATCTGT 4800 
TTATTCTATA GTTATTAAGT TCTTTAAAAT GTAAAGCCAT GCTGGAAAAT AATACTGCTG 4860 
AGATACATAC AGAATTACTG TAACTGATTA CACTTGGTAA TTGTACTAAA GCCA AACAT A 4920 
TATATACTAT TAAAAAGGTT TACAG AATTT TATGGTGCAT TACGTGGGCA TTGTCTTTTT 4980 
AG ATGCCCAA ATCCTTAGAT CTGGCATGTT AGCCCTTCCT CCAATTATAA GAGG ATATGA 5040 

25 ACGAAAAAAA AAAAAAAAAA AA 



30 



SeqIDNO: 10 Protean seocence; 
Protein Accession*: NP 003465.2 



1 11 21 31 41 51 

MAARPLPVSP ARALLLALAG ALLAPCEARG VSLWNEGRAD EWSASVRSG DLWIPVKSFD 60 
SKNHPHVLNI RLQRESKELi INLERNBGLI ASSFTETHYL QDGTDVSLAR NYTVILGHCY 120 

35 YHGHVRGYSD SAVSLSTCSG LRGUVFENE SYVLEPMKSA TNRYKLFPAK KLKSVRGSCG 180 
SHHNTFNLAA KNVFPPPSQT WARRHKRBTL KA1KYVELV1 V ADNREFQRQ GKDLEKVKQR 240 
UHANHVDK FYRPLNIRIV LVGVHVWNDM DKCSVSQDPF TSLHEFLDWR KMKLLPRKSH 300 
DNAQLVSGVY FQGTTIGMAP JMSMCTAPQS GGIVMDHSDN FLG AA VTLAH ELGHNFGMNH 360 
DILDRGGSCQ MAVEKGGC3M NASTGYFFPM VFSSCSRKDL ETSLEKGMGV CLFNLPEVRE 420 

40 SFGGQKCGNR FVEBGEECDC GEPEBCMNRC CNATTCTLKP DA VCAHGLCC EDCQLKPAGT 480 
ACRDSSNSCD LPEFCTQASP HCPANVYLHD GHSCQDVDGY CYNGICQTHB QQCVTXWGPG 540 
AKPAPGICFB RVNSAGDPYG NCGKYSKSSF AKCBMRDAKC GKIQCQGGAS RFVIGTMAVS 600 
IETNIPLQQG GRILCRGTHV YLGDDMPDPG LVLAGTKCAD GK1CLNRQCQ NKVFGVHEC 660 
AMQCHGRGVC MNRKNCHCEA HWAPPFCDKF GFGGSTDSGP IRQADNQGLT IGILVTILCL 720 

45 LAAGFWYLK RKTLIRLLFT NKKTTIEKLR CVRPSRPPRG FQPCQAHLGH LGKGLMRKPP 780 
DSYPPKDNPR RLLQCQNVDI SRPLNGLNVP QPQSTQRVLP PLHRAPRAPS VPARPLPAKP 840 
ALRQAQGTCK PNPPQKPLPA DPLAKTTRLT HALARTFGQW BTGLRLAPLR PAPQYPHQVP 900 
RSTHTAYDC £ 

50 SeqIDNO; II DNAseqoescc 

Nockic Acid Accession ft NM.001394 

Coding sequence: 40O-1584(uiute&«dscqueQCis eonespaod to 

1 11 21 31 41 51 

55 t l I i l l 

GGCACGAGGG CTGAGCGCCG GAGGAGCGTA GGCAGGGCAG CGCTGGCGCC AGCGGCGACA 60 
GGAGCCGCGC GACCGGCAAA AATACACGGG AGGCCGTCGC CGAAAAGAGT CCGCGGTCCT 120 
CTCTCGTAAA CACACTCTCC TCCACCGGCG CCTCCCCCTC CX5CTCTGCGC GCCGCCCQGC 180 

^ TGGGCGCCCG AGGCCGCTCC GACTGCTATG TQACCOCOAO GCTGCGGGAG GAAGGGGACA 240 

60 GGGAAGAAGA GGCTCTCCCG CGGGAGCCCT TGAGGACCAA GTTTGCGGCC ACTTCTQC AG 300 
GCGTCCCTTC TTAGCTCTCG CCTOCCCCTT TCTGCAGCCT AGGCGGCCCA GGTTCTCTTC 360 
TCTTCCTCGC GCGCCCA6CC GOCTCGGTTC CCXKKX3ACXATGGTGACGAT GGAGGAGCTG 420 
CGGG AGATGG ACTGCAGTGT GCTCAAAAGG CTG ATGAAOC GGG AGGAG AA TGGCGGCGGC 480 
GOGGGCGGCA GOGGCAGCCA CGGCACCCTC GGGCTGOCGA GCGGCGGCAA GTGOCTGCTG 540 

65 CTGGACTGCA GACCGTTCCT GGCGCACAGC GCGGGCTACA TCCTAGGTTC GGTCAACGTG 600 
CGCTGTAACA CCATCGTGCG GCGGCGGGCT AAGGGCTCCG TGAGCCTGGA GCAGATCCTG 660 
CCCGCCQAGG AGGAGOTACG CQCCCGCTTG OGCTCOGGCC TCTACTOGGC GGTCATOGTC 720 
TACGACG AGC GCAGCCCGCO CGCCGAGAGC CTCCGCGAGG ACAGCACCGT GTCGCTGGTG 780 
GTGCAGGCGC TGCGCCGCAA CGCCGAGCGC ACOGACATCT GCCTGCTCAA AGGCGGCTAT 840 

70 GAGAGGTTTT CCTCCGAGTA CCCAG AATTC TGTTCTAAAA CCAAGGCCCT GGCAGCCATC 900 
CCACCCCCGG TTCCCCCCAG CGCCACAGAG CCCTTGGACC TGGGCTGCAG CTCCTGTGGG 960 
AOOOCACTAC ACGACCAGGG GGGTCCTGTG G AG ATCCTTC CCTTCCTCTA CCTCGGCAGT 1020 
GCCTACCATG CTGCCCGGAG AGACATGCTG GACGCCCTGG GCATCACGGC TCTGTTG AAT 1080 
GTCTCCTCGG ACTGCCCAAA OCACnTGAA GGACACTATC AGTACAAGTG CATCCCAGTG 1140 

75 GAAGATAACC ACAAGGCCGA CATCAGCTCC TGGTTCATGG AAGCCATAGA GTACATOGAT 1200 
GCCGTGAAGO ACTGCCGTGG GCGCGTOCTG OTGCACTQCC AGOCGGQCAT CTCOC GGTCG 1260 
GOCACCATCT GCCTGGCCTA CCTGATGATG AAGAAACGGG TGAGGCTGGA GGAGGCCTTC 1320 
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GAGTTCGTTA AGCAGCGOCO CAGCATCATC TCGCCCAACT TCAGCTTCAT OOGGCAOCTG 1380 
CTOCAGTTCO AGTCCCAGGT GCTGGCCACG TCCTGTGCTG CGGAGGCTQC TAGCCCCTCG 1440 
GGACCCCTOC GGQAGCGGGG CAAGACCCCC GCCACCCCCA CCTCGCAGTT CGTCITCAGC 1500 
TTTCCGGTCT COGTGGGCGT GCACTCGGCC CCCAGCAGCC TGCCCTACCT GCACAGCCCC 1560 
ATCACCACCT CTCCCAGCTG TTAGAGCCGC CCTGGGGOCC CCAGAACCAG AGCIOGCTOC 1 620 
CAGCAAGGGT AOGACGGGCC GCATGCGGCA GAAAGTTGGG ACTGAGCAGC TGGGAGCAGG 1680 
CGACCGAGCT CCTTCCCCAT CATTTCTOCT TGGCCAACGA CX3AGGCCAGC CAGAATGGCA 1740 
ATAAGGACTC CGAATACATA ATAAAAGCAA ACAGAACACT CCA ACTTA G A GCAATAA CCO 1800 
GTGCCGCAGC AGCCAGGGAA GACCTTGGTT TGGTTTATGT GTCAGTTTCA CTTTTCCGAT 1860 
AGAAATTTCT TACCTCATTT TTTTAAGCAG TAAGGCTTGA AGTGATGAAA CCCACAG ATC 1920 
CTAGCAAATO TGCCCAACCA GCTTTACTAA AGGGGGAOGA AGGGAGGGCA AAGGGATGAG 1980 
AAGACAAGTT TCCCAG AAGT GCCTGGTTCT GGGTACTTGT CCCTTTGTTG TCGTTGTTGT 2040 
AGTTAAAGGA ATTTCATTTT TAAAAG AAAT CTTCGAAGGT GTGGTTTTCA TTTCTCAGTC 2100 
ACCAACAOAT GAATAATTAT GCTTAATAAT AAAGTATTTA TTAAGACTTT CTTCAGAGTA 2160 
TO AAAGTACA AAAAGTCTAG TTACAGTGGA TTTAGAATAT ATTTATGTTG ATGTCAAACA 2220 
GCTGAGCACC GTAGCATGCA GATGTCAAGG CAGTTAGGAA GTAAATGGTG TCTTGTAGAT 2280 
ATGTGCAAGO TAOCATO ATO AOCAACTTOA OTTTGTTGCC A CTGAOA AGC AGGCGGGTTG 2340 
GGTQGGAGGA GGAAGAAAGG GAAGAATTAG GTTTGAATTG CTTTTTAAAA AAA AAAGA AA 2400 
AG AAAAAGAC AGCATCTCAC TATGTTGCCA AGGCICATCT TGAGAAGCAG GCGGGTTGGG 2460 
TGGGAGGAGG AAGAAAGGGA AGAATTAGGT TTGAATTGCT i niiAAAAA AAAA 

SeqIDNO: 12 Protein seonencc: 
Protein Accession*: NP 001385 

1 U 21 31 41 51 

MVTMEELREM DCSVIjUlMN RDENGGGAGG SGSHGTLGLP SGGKCULLDC RPFLAHSAGY 60 
ILGSVNVRCN TIVRRRAKGS VSUEQILPAB EBVRARLRSG LYSAVIVYDB RSPRAESLRB 120 
DSTVSLWQA LRRNAERTDI CLLKGGYERF SSEYPBFCSK TKALAAIPPP VPPSATEPLD 180 
LGCSSCGTPL HDQGGPVEIL PFLYLGSAYH AARRDMLDAL GITAIXNVSS DCPNHFEGHY 240 
OYKCIPVBDN HKADISSWFM EAIEYIDAVK DCRGRVLVHC QAG1SRSATI CLAYLMMKKR 300 
VRLHEAFEFV KQRRSHSPN FSFMGQLLQF ESQVLATSCA AEAASPSGPL RERGKTPATP 360 
TSQFVFSFPV SVGVHSAPSS LPYLHSPTTT SPSC 



SeqIDNO: 13 DMAs 

hfadek Acid Accession fc none found 

Coding grp*™*? 63-340(uiKkrtined *c q ir n» CT correspond to start and stop codons) 

1 11 21 31 41 51 

AOCGOnTGC CTtCTCTT AO QCTTTQAAOC ATTI 11U1CT GTGCTCCCTG ATCTTCAGGT 60 
CACCACCATG AAGTTCTTAG CAGTCCTGGT ACTCTTGGGA GTTTCCATCT TTCTGGTCTC 120 
XGCCCAGAAT CCGACAACAO CTGCTCCAGC TGACAOGTAT CCAGCTACTG GTCCTGCTG A 180 
TG ATG AAGCC CCTGATGCTG AAACCACTGC TGCTGCAACC ACTGCGACCA CTGCTGCTCC 240 
TACCACTGCA ACCACCGCTG CTTCTACCAC TGCTCGTAAA GACATTOCAG TTTTACCCAA 300 
ATGGGTTGGG GATCTCCCGA ATGGTAGAGT GTGTCCCTGA GATGGAATCA GC TTGAOT CT 360 
TCTGCAATTO GGTCACAACT ATTCATGCTT CCTGTGATTT CATCCAACTA CTTACCTTGC 420 
CTACG ATATC CCCTTTATCT CTAATCAGTT TATTTTCTTT CAAATAAAAA ATAACTATGA 480 
GOGAGCTAAC AT 

SeqIDNO: 14 Piotcra sequence: 
Prob^ Accession S: none found 

1 11 21 31 41 51 

MKFLAVLVLL GVSIFLVSAQ NPTTAAPADT YPATGP ADDE APDAETT AAA TTATTAAPTT 60 
ATT AASTTAR KDIFVLPKWV GDLPNGRVCP 



SeqIDNO: 15 DNA s 

Nucleic Add Aixesstonff: NM 016640.2 

Coding sequence: 39-1358(imdciiined sequences correspond to start and stop codorts) 



1 II 21 31 41 51 

GCHTAAGTTG ACCTCTGGOT CCGG AATCGC GGGCAAAGAT GGCGGCGGCC AGGTGTTGGA 60 
GGCCTTTGCT ACGCGGTCCG AGGCTTTCAT TGCACACCGC GGCTAATGCC GCCGCCACGG 120 
CTACAGAAAC GACCTCCCAA GACGTCGCGG CGACCCCCGT CGCGCGGTAC CCGCCGATTG 180 
TGGCCTCCAT GACAGCCGAC AGCAAAGCTG CACGGCTGOG GCGGATCGAG CGCTGGCAGG 240 
CGACGGTGCA CGCTGCGGAG TOGGTAGACG AGAAGCTGCG AATCCTCACC AAGATGCAGT 300 
TTATGAAGTA CATGGTTTAC CCGCAGACCT TCGCACTGAA TGCCGACCGC TGGTACCAGT 360 
ACTTCACCAA GAOCGTGTTC CTGTCGGGTC TGCCGCCGCC CCCAGCGGAG CCCGAGCCOG 420 
AGCCCGAAOC CGAAOCTGAA CCTGCGCTGG ACCTCGCGGC GCTGCGTGCG GTCGCCTGCG 480 
ACTGCCTGCT GCAGGAGCAC TTCTACCTGC GGCQCAGGCG GCGCGTGCAC CGTTACGAGG 540 
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AGAGCGAGGT CATATCTTTG CCCTTOCTGG ATCAGCTGGT GTCAACOCTC GTGGGCCTOC 600 
TCAGCCCACA CAACCCGGCC CTGGCCGCTG CCGCCCTCGA TTATAGATGC CCAGTTCATT 660 
TTTACTGGGT GCGTGGTGAA GAAATTATTC CTCGTGGTCA TCGAAGAOOT CGAATTGATO 720 
ACTTGCGATA CCAGATAGAT GATAAACCAA ACAACCAGAT TCGAATATCC AAGCAACTCG 780 
5 CAGAGTTTGT GCCATTGGAT TATTCTGTTC CTATAGAAAT CCCCACTATA AAATGTAAAC 840 
CAGACAAACT TOCATTATTC AAACGGCAGT ATGAAAACCA CATATTTGTT GGCTCAAAAA 900 
CTGCAGATCC TTGCTGTTAC GGTCACACCC AGTTTCATCT CTTAC CTO AC AAATTAAGAA 960 
GGG AAAGGCT TTTGAG ACAA AACTGTGCTG ATCAGATAGA AGTTGTmT AGAGCTAATG 1020 
CTATTGCAAG CCTCTTTGCT TGGACTGGAG CACAAGCTAT GTATCAAGGA TTCTG GAGTG 1080 
10 AAGCAGATGT TACTCGACCT TTTGTCTCCC AGGCTGTGAT CACAGATGGA AAATACTTTT 1140 

c ci micr o ctaccaocta aatactttgg cactoactac acaagctgat caaaataacc 1200 

CTCGTAAAAA TATATGTTGG GGTACACAAA GTAAGCCTCT TTATGAAACA ATTGAGGATA 1260 
ATGATGTGAA AGGTTTTAAT GATG ATGTTC TACTTCAGAT AGTrCACTTT CT ACTGAATA 1320 
G ACCAAAAG A AGAAAAATCA CAGCrGTTGG AAAACTGAAA AAGCATATTT GATTGAGAAC 1380 

1 5 TGTGGGAATA TTTAAATTTT ACTGAAGGAA CAATAATGAT GAGATTTGTA ACTGTCAACT 1440 
ATTAAATACA TTGATTTTTG AGACAAATAT TTCTTATGTC AA CCTGTT AT TAG ATCTCTT 1500 
ACTCTGCTCA AATTCATCAC TGAAAGATTT AATTTTAGTT AOCTTTTGTT GATTTAAAAA 1560 
TAATTGCATT TGTATATTGC TAACTGATAA G ACAAATTGA GTTATTGAGC TATTAAATGC 1620 
ACATTTTAAT ATAAATGCAG AAATCCCAAA TAAAATGCTA ACATACTGAA TTCAGTAATT 1680 

20 AAAAGAACCCACTGC 



25 



40 



SeqIDNO: 1 6 Protein sequence; 
Protein Accession #: NP0S7724.1 

1 U 21 31 41 51 



MAAARCWRPL LRGPRLSLHT AANAAATATB TTSQDVAATP V ARYPPIVAS MTABSKAARL 60 
RRIERWQATV HAAESVDEKL RILTKMQFMK YMVYPQTFAL NADRWYQYFT KTVFLSGLPP 120 

30 PPAEPEPEPE PEPEPALDLA ALRAVACDCL LQEHFYLRRR RRVHRYEESE VTSLPFLDQL 180 
VSTLVGLLSP HNPALAAAAL DYRCPVHFYW VRGEEIIPRG HRRGRIDDLR YQIDDKFNNQ 240 
IRISKQLABP VFLDYSVPIB IKllKCXPDtC LPLPKRQYEN HIFVGSKTAD PCCYGHTQFH 300 
LLPDKLRRER LLRQNCADQ1 EWFRANAJA SLFAWTGAQA MYQGFWSKAD VTRPFVSQAV 360 
rTDGKYPSFF CYQLNTLALT TQADQNNPRK N1CWGTQSKP LYETIEDNDV KGFNDDVLLQ 420 

35 WHFLLNRPXEEKSQLLEN 



SeqIDNO: 1 7 DMA sequence 

NadefcAitf Accession*: NM_025059.1 __ j 

Coding sequence: 3-2150 (upderttned seqnpurrs correspond to start and stop codons) 



, 21 31 41 51 

GCATGAGCCT GGACTGCACC AGCCATATCG CGCTGGGTGC CGCTTCGCCA GCGCCCOAGG 60 
45 AAACTTACG A TCATCTTTCG GAAffTOCCGG TCACGCGGGA GCAOTTAAAC CACTATCGGA 120 
ATGTGGCTCA AAATGCTCGA AGTGAACTTG CAGCAACTTT GGTCAAATTT GAATGTGCTC 180 
AGTCTGAGCT TCAAGACCTC CGATCCAAGA TGCTTTCTAA AGAAGTCTCC TGTCAAGAAC 240 
TGAAAGCTOA AATGGAGAGC TACAAGGAAA ACAATGCCAG AAAATCATCTCTXXTTACCT 300 
CTTTGAGAG A CAOAGTTCAG GAACTAOAAG AAGAATCAGC AGCACTTTCC ACTTCTAAAA 360 
50 TCAGAACAGA AATCACAGCT CACGCTGCAA TCAAGGAGAA CCAGGAATTA AAGAAGAAAG 420 
TTGT AG AGTT AAATGAAAAA TTACAAAAGT GTTCAAAAGA AAATGAGOAG AATAAGAAAC 480 
. AAGTTTCAAA GAATTGCAGG AAACATGAGG AATTTCTGAC TCAACTOCGT GACTGCTTGG 540 
ATCCAGATGA GAQOAATGAC AAGQCATCAO ATGAAGATTT AATTTTAAAG CTTAGAGACC 600 
TGCGCAAAGA AAATG AATTC GTGAAAGGAC AAATTGTTAT TCTTGAAGAG ACTATAAATG 660 
55 TCCATGAGAT GGAAGCAAAA GCTAGCAGAG AAACGATCAT GAGGCIGGCT TCAGAAGTCA 720 
ACAG AOAGCA GAAAAAAGCT GCCTCCTGTA CTGAAGAG AA AGAGAAGCTG AACCAGGACC 780 
TOCTCAGTGC TGTAGAAGCA AAAGAAGCTC TTGAAAGGGA AGTTAAGArC TTCCAAGAAA 840 
GGCTGCTTGC TGGCCAGCAG GTCTGGGATG CCTCAAAGCA GGAAGTGAGC CTCCTGAAGA 900 
AAAGCTCTTC TG AGTTGGAG AAGAGTTTGA AGGCCAGTCA GGATGCAGTC ACAACCTCAC 960 
60 AAAGCCAGTA CTCCTCATTT AGGGAGAAAA TCGCAGCCCT CCTTAGGGGC AG ATTGAOCA 1020 
TG ACTGGGTC CACTGAGG AC ACCATTTTGG AGAAGATTCG AGAAATGGAC AGCCGGGAAG 1080 
AAAGCAGGGA CCGGATGGTC TCCCAGCTTG A AGCCCAAAT ATCTGAGCTT GTTGAACAGT 1140 
TGGGAAAGGA GTCTGGGTTT CACCAGAAAG CTCTCCAGAG GGCCCAGAAA GCAGAQAATA 1200 
TGTTGGAGAC TCTTCAGGGT CAGCTGACAC ACCTGGAGGC AGAGCTGGTT TCTGGAGGTG 1260 
65 TTTTGCG AGA CAACTTGAAT TTTGAGAAAC AAAAATATCT TAAAT TTCTG GATCAGCTTT 1320 
CTCAGAAAAT GAAGTTGGAC CAGATGGCTG CCGAACTTGG CTTTGACACG CGGCTGGACG 1380 
TGGTTTTAGC TCGAACAGAG CAGCTGGTTC GTCTTGAGAG CAATGCAGTC ATTGAGAACA 1440 
AGACCATTGC CCACAATTTG CAGAGAAAGC TAAAGACACA OAAAGAGAG A CTGGAGAGCA 1500 
AAGAATTACA CATOAGCCTC CTCCGOCAGA AAATAGCCCA GCTGGAGGAG GAOAAGCAOO 1560 
70 CACGCACGGC CTTGGTGGTT GAGAGGGACA ACGCGCATCT TACCATCAGG AACTTGCAGA 1670 
AGAAGGTGGA GAGGCTGCAG AAAGAGCTGA ACACGTGTCO AGACTTGCAC ACCGAGCTCA 1680 
AAGCCAAACT OGCCGACACC AATGAACTGA AGATTAAAAC TTTGGAACAG ACTAAAGCCA 1740 
TTGAAGATCT AAACAAATCC AGAGACCAAC TGGAGAAG AT GAAGGAGAAA GCTGAGAAAA 1800 
AGCTCATGTC TGTCAAGTCA GAACTGGATA CCACAGAACA TGAGGCTAAG GAG AATAAAG 1860 
75 AAAGGGCCAG AAACATGATA GAAGTGGTAA CCAGTGAAAT GAAGACACTA AAAAAATCTC 1920 
TGGAAGAAGC AGAAAAGAGA GAAAAGCAGC TGGCAG ACTT CAGGGAGGTG GTGTCGCAGA 1980 
TGCTAGGCTT GAACGTG ACC AGCCTTGCTC TTCCTGATTA TGAAATCATC AAGTGTCTTG 2040 
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AAAGATTGGT CX^TTCACAT CAGCATCACT TTGTTACCTG TGCCTGCCTC AAAGATGTGA 2100 
CTACTGGGCA AGAGAGGCAC CCACAAGGCC ATTTACAGCr TCTTCATXGA, ACACTQTATC 2160 
TCTTGAGAGA GGTGGCCATA AGACATGGCA CACAATTCCC AATTTCACAA ATTCCTCATG 2220 
TCTTTGAGAT TTGATCAGTT TGTGAATATT TTATGCTTTG ATGATATAGT GAGAATGCAT 2280 
5 CACITGCAAA AACGATCTCA AAAGTGTCAG CCTTAGATAA ACGTCAGCAT TAAAAAACGC 2340 
CAAAAAAAAA AAAAAAAAGC ATTTTAGGAT CCAGAAG AAT TCCACCAGAT TGCATG AGTT 2400 
AGATTGGGAA ATOGG AGTGG 0 AGATAATAT TGGGAGGTAT CTATTTTAAG TCAGGGGCTT 2460 
TACTAGCCG A TTTAGTTCTC ACAATAACCA TGTGGAG AAG CTGTGACATT TTTAATTTAC 2520 
AACCTTTCTG GOGCTCAGAC ATAAAGTTAC CTATCCAAGG TTGCAGTTGG GTAGTGGTGG 2580 

10 GACCAGGATG GACAACTCAT TGGCCCTGCC TCAAAAGCCA TACXTTCTTCT OCTGCTATGC 2640 
AGAATCTGTT TCTCCTGAAT CTCTGTGATG CTGGTGGGAA TTGTTTGCAT AGAGGAAGGA 2700 
CAATAACCCT GCCATCGTGA GTTAATGTCC GGGCTGGTCA CAGTGGTTCA TGCCTGTAAT 2760 
COCAGCACIT TGGGAGTCCA AGGCAGGCAT ATCATTTGAG GTCAGGAGTT TAAGACCAGC 2820 
CTOGCTAACA TAGTGAG ACC CTGTTTCTAC TAAAAATACA AAAATAAGCC AGGTGTGGTG 2880 

15 GTGCATGACT GTAATOCCAA CTACTCAGCA GGAGAAGCAC TTGAACCCAG GAGAOGGAGG 2940 
CGGCAGTGAG CCAAGATTGT GCCACTGCAC TCCAGCCTGG GCGACAGAGT GAAACTACAT 3000 
CTCAGAAAAA AAAAAAAAAA AAAAAAAAAA AAGTTAATGT OCAAAAATGA CAGATTTACA 3060 
AGTGTAAGCr ATATGATTTC TTCAAAAAGC AAAAGCAATA TAOCTAATTC ATTITOATCA 3 120 
AACTTACATA GGTCICAGGT CCTGTAAGAA ACTTGOCTGT TCTAACTOTT GCTACCAG AT 3180 

20 TATATCTGGT GGTAATTOTT AATGTTTCAG CAOGGCTOOT CTCVGTCOT TAA^TGG^A 3240 
AGCTTTATTT GGAGCOCAAC CCTATAAGAT GAAGAAATCC TATATAGTCT TATTCAOCAA 3300 
TATATCCAAT ACACCCACAG CAATGGTACC TTTTT AAGAT CAGGATTTT A TTATGAATTC 3360 
CTGTCACTTT CTGTTTTCCA TTTAAATTTC TATTTTACAA ATTTTTCAGG GAATCATATT 3420 
CTTAACTTCA CTGAG AAAAA TGTGTTACTC TTTTGGACAA TTTATCTTAT TTCTATCATA 3480 

25 TAAG ATGTAT TTTTTTATTG TCCTTAAAAG AAGCTCTAGC ATGAAATTAA AGGA^GGGA 3540 
AAGAAATAG A TCTGGTGCAC CCGAACATTA GGAOAAAATG AAAAATATAC AA0CAACO3T 3600 
TCGTG AGTCA TCAAAAAGTC AAAGTCAGCC TGGCCAACAT GGCAAAACTC OGTCTCTGCA 3660 
AAAAATACAA AAATGAGCCC GGTATGTTGG CATATGCCTG TAATCCCAGC TACTCGAGAG 3720 
GCTGAGACAC GAAAATTGCT TGAACCTGGG AGGCGGAGGT TGCAATGAGC CGAGATCGCG 3780 

30 CTACTGCACT CCAGCCTCGG CAACAGAGAG AG ACCTTGTC TCAAAAAACA ACAACAACAA 3840 
AAAGTCAAAG TCATAATAAG CAAATTATTG GCTTCTTTCT TCTAGACYAA AAGAAATTAA 3900 
AOAGATGAAA CAATCAATTG CAAGGGTCAA AACTAGATTG GATCTTGGTT TGAATGAAAA 3960 
AAAGCATAAA ATATTCTTGC AATAATTGTA AAAATTTGAA TGTGGACTAA GTCCTAGATT 4020 
ATATTAAAAT ATTTTTAATT TTTTAAGCTT GACAAATGCA CTGATTGTTA TACTTTAAAT 4080 

35 AACTAAAAAT CTGAGAATCC ACAGTGCTAC AGACAATAAA TGATAAAATG GGAAAAAAAA 4140 
AAAAAAAAAA A 

Sag ID NO: 18 Protein sequence; 
40 Protein Accession fc NPJ79335.1 

1 11 21 31 41 51 

MSUDCISHIA LGAASPAPEE TYDHLSEVPV TREQLNHYRN VAQNARSELA ATLVKFECAQ 60 
45 SELQDLRSKM LSKEVSCQEL KAEMESYKEN NARKSSLLTS LRDRVQELEE ESAALSTSKI 120 
RTHTAHAAI KENQELKKKV VELNEKLQKC SKENEENKKQ VSKNCRKHEB FLTQLRDCLD 180 
PDERNDKASD EDLILKLRDL RKENEFVKGQ lYILEKTINV HEMEAKASRE TIMRLASEVN 240 
REQKKAASCT EEKEKLNQDL LSA VBAKEAL EREVK3FQER IXAGQQVWDA SXQEVSLLKK 300 
SSSELEKSLK ASQDA VTTSQ SQYSSFREKJ AALLRGRLSM TGSTEDTILE KJREMDSR£E 360 
50 SRDRMVSQLB AQBELVEQL GKESGFHQKA LQRAQKAENM UBTLQGQLTH LEAELVSGGV 420 
LRDNLNFEKQ KYLKFLDQLS QKMKLDQMAA ELGFDTRLDV VLARTBQLVR LESNA VIENK 480 
TTAHNLQRKL KTQKERLESK ELHMSLLRQK IAQLEEEKQA RTAL WERDN AHLTIRMLQK 540 
KVERLQjKELN TCRDliTTELK AKIADTNELK IKTLEQTKAI EDLNKSRDQLEKMKEKAEKK 600 
LMSVKSELOTTEHEAKENKB RARNMIHVVT SEMKTLKKSL EEABKREKQL ADFRBWSOM 660 
55 LGLNVTSLAL PDYEHKCLE RLVHSHQHHF VTCACLKDVT TGQERHPQGH LQLLH 

Seq ID NO: 19 DNA sequence 

Nucleic Acid Accession AF071552, NM_000662 

60 Coding sequence: 441-1313 (undertfaxd sequences correspond to start and stop codona) 

1 11 21 31 41 51 

65 CTTTGTATAA GGCTCAGCTA AAAGGOAAAT TGAGTGGGTC AGGTACCACG GATACTATAC 60 
ACTCTATTGC ATGATTCTOC TGCCTACATC AGAAG ACGTT TATAAGCCTA TTTTAAAGGA 120 
TACCAGTTGG AATCTCTCTT TTATTAATCA CCAAGAGAAC CATGAACAAG CTGTTTATCA 180 
TTTGACTCAT CATTTAATCr TGATTTOCAG CTTCTCACAC TTGAAAGAAG ACATAATACA 240 
TTTCTCACAG QATTCTGGGA CTATTAACTG AACTTATOTQ TQTAAAAGGA ATTCATACAA 300 

70 TO AAAGCACT AGAAATAATT ATTATACTTA TAACCATTGT ATTTTTACAT GTTTAAAATA 360 
TAGCCATAAT TAGCCTACTC AAATCCAAGT GTAAAAGTAA AATGATTTGC I J 1 Ub J J J 1G 420 
TTTTCCTTGC TTAGGGGATC ATGGACATTG AAGCATATCT TGAAAGAATT GGCTATAAGA 480 
AGTCTAGGAA CAAATTGGAC TTGGAAACAT TAACTOACAT TCTTCAACAC CAGATOCGAG 540 
CTGTTCCCTT TGAGAACCTT AACATCCATT GTGGGGATOC CATGGACTTA GGCTTAGAGG 600 

75 CCATTTTTG A TCAAGTTGTG AGAAGAAATC GGGGTGG ATG GTGTCTOCAG GTCAATCATC 660 
TTCTGTACTG GGCTCTGACC ACTATTGGTT TTGAGACCAC GATGTTCGG A GGGTATOTTT 720 
ACAGCACTCC AGCCAAAAAA TACAGCACTG GCATGATTCA CCTTCTCCTG CAGGTGACCA 780 
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TTGATCGCAG GAACTACATT GTCGATGCTO GGTTTGGACG CTCATACCAO ATGTGGCAOC 840 
CTCTGOAOTT AATTTCTGGG AAGG ATCAGC CTCAOOTGCC nOTQTCTTC CGTTTG ACGG 900 
AAOAOAATGO ATTCTOQTAT CTAQACCAAA TCAOAAGGOA ACAGTACATT CCAAATGAAG 960 
AATTTCTTCA TTCTGATCTC CTAGAAGACA GCAAATACCG AAAAATCTAC TCCTTTACTC 1020 
5 TTAAGCCTCG AACAATTGAA GATTTTGAGT CTATGAATAC ATACCTGCAG ACATCTCCAT 1080 
C A ltTlUJUl l TACTAGTAAA TCA1 U Kill CCTTGCAGAC OCCAQATGGO QTTCACTGTr 1140 
TGGTGGOCTT CACCCTCACC CATAGG AGAT TCAATTATAA GGACAATACA GATCTAATAG 1200 
AGTTCAAGAC TCTGAGTGAG GAAGAAATAG AAAAAGTGCT GAAAAATATA TTTAATATTT 1260 
CCTTGCAGAG AAAGCTTGTG CCCAAACATG GTGATAGATT TTTTACTATT TAGA ATAAGG 1320 

10 AGTAAAACAA TCTTGTCTAT TTGTCATCCA OCTCAOCAGT TATCAACTGA CGACXTTATCA 1380 
TGTATCTTCT GTACCCTTAC CTTATTTTGA AGAAAATCCT AGACATCAAA TCATTTCACC 1440 
TATAAAAATG TCATCATATA TAATTAAACA GCTTTTTAAA GAAACATAAC CACAAACCIT 1500 
TTCAAATAAT AATAATAATA ATAATAATAA ATGTATTTTA AAGATGGCCT GTGGTTATCT 1560 

^ ^ TOGAAATTGG TGATTTATGC TAGAAAGCTT TTAATGTTGG TTTATTQTTQ AATTC 

Seq IDNO. 20 Protein sequence: 
Protein Accession fc NPJD00653. 1 



20 1 11 21 31 41 51 
I I I I I I 

MDffiAYLERl GYKKSRNKLD LETLTDILQH QIRAVFFENL KTHCGDAMDL GLEAIFDQW 60 
RRNRGGWCLQ VNHLLYWALT TKjFETTMJLG GYVYSTPAKK YSTGMMIXL QVTIDGKNY1 120 
VDAGPGRSYQ MWQPLELISG KDQfQVPCVF RLTEENGFWY LDQIRRBQY1 FNHEFLHSDL 180 
25 LEDSKYRKIY SFTLKPRTIE DFESMNTYLQ TSPSSEVFTSK SFCSLQTPDG VHCL VGFTLT 240 
HRRFNYKDNT DUEFKTLSE EHEKVLKNI FNISLQRKLV PKHGDRFFT1 

Seq IDNO. 21 DNA sequence 
Nuck*A(^ Accession #: NM 003714 

30 Cbdnig sequence: 123-1031 (underlined sequences correspond to start and stop cottons) 



1 11 21 31 41 51 

35 CGGCACGAGC AAAAAGGAAG AGTGGGAGGA GGAGGGGAAG CGGCGAAGGA GGAAGAGGAG 60 
GAGGAGGAAG AGGGGAGCAC AAAGGATCCA GGTCTCCCGA CGGGAGGTTA ATACCAAGAA 120 
CCATGTGTCC CGAGCGGCTG GGCCAGTTCA TGACCCTGGC TTTOGTCTTO GCCACCnTG 180 
ACCCGGCGCG GGGGACCGAC GCCAOCAACC CACOCGAGGG TOCOCAAGAC AGGAGCTCCC 240 
AGCAGAAAGO CXXKXTGTOC CTOCAGAATA CAGCGGAGAT CCAGCACTGT TTGGTCAACG 300 

40 CTGGCGATGT GGGGTGTGGC GTGTTTGAAT GTTTCGAGAA CAACTCTTGT GAGATTCGGG 360 
GCTTACATGG GATTTGCATG ACTTTTCTGC ACAACGCTGG AAAATTTGAT GOOCAGGGCA 420 
AGTCATTCAT CAAAGACGOC TTGAAATGTA AGGCCCACGC TCTGCGGCAC AGGTTCGGCT 480 
GCATAAGCCG GAAGTGCCCG GCCATCAGGG AAATGGTGTC CCAGTTGCAG CGGG AATGCT 540 
ACCTCAAGCA CX3ACCTGTGC GCGGCTGCCC AGGAGAACAC CCGGGTGATA GTGGAGATGA 600 

45 TCCATTTCAA GGACTTGCTG CTGCAOGAAC CCTACGTGGA CCTOGTGAAC TTGCTGCTG A 660 
CCTGTGGGGA GGAGGTGAAG GAGGCCATCA CCCACAGCGT GCAGGTTCAG TGTGAGCAGA 720 
ACTGGGGAAG OCTGTGCTOC ATCTTGAGCT TCIGGACCTC GGOCATCCAG AAGOCTOOCA 780 
CGGCGCCCCC CG AGCGCCAG CCCCAGGTOG ACAGAACCAA GCTCTCCAGG GCCCACCACG 840 
GGGAAGCAGG ACATCACCTC CCAGAGCCCA GCAGTAGGGA GACTGGCCG A GGTGCCAAGG 900 

50 OTGAGCGAGG TAGCAAGAGC CACCCAAACG CCCATGCCOG AGGCAGAGTC GGGGGCCTTG 960 
GGGCTCAGGG ACCTTCCGGA AGCAGCGAGT GGGAAGACGA ACAGTCTGAG TATTCTGATA 1020 
TCCGG AGGTGAAATG AAAGG CCTGGCCACG AAATCTTTOC TCCACGCCGT CCATTTTCTT 1080 
ATCTATOGAC ATTCCAAAAC ATTTACCATT AOAGAGGGGG GATGTCACAC GCAGGATTCT 1 140 
GTT3GGGACTG TGGACTTCAT CGAGGTGTOT GTTOOOOGAA OGGACAGGTG AGATGGAGAC 1200 

55 CCCTGGGGCC GTGGGGTCTC AGGGGTGCCT GGTGAATTCT GCACTTACAC GTACTCAAGO 1260 
GAGCGCGCCC GCGTTATCCT CGTACCTTTG TCTTCTTTCC ATCTGTGG AG TCAGTGGGTG 1320 
TCGGCCGCTC TGTTGTGGGG GAGGTG AAOC AGGGAGGGGC AGGGCAAGGC AGGGOOOOCA 1380 
GAGCTGGGCC ACACAGTGGG TGCTGGGOCT CGCCCCGAAG CTTCTGGTGC AGCAGCCTCT 1440 
GGTGCTGTCT COGOGG AAGT CAGGGOGGCT GO ATTCCAGG ACAGG AGTGA ATGTAAAAAT 1500 

60 AAATATCGCT TAGAATGCAG G AG AAGGGTG GAG AGGAGGC AGGGGCOGAG GGGGTGCTTG 1560 
QTGOCAAACT GAAATTCAGT TTCTTGTGTG G GGCCTT GCG GTTCAGAGCT CTTGGCQAGG 1620 
GTGGAGGGAG GAGTGTCATT TCTATGTGTA ATTTCTGAGC CATTGTACTG TCTGGGCTGG 1680 
GGGGGACACT GTCCAAGGGA GTGGCOCCTA TOAGTTTATA TTTTAACCAC TGCTTCAAAT 1740 
CTCGATTTCA CI 11111 1 AT TTATCCAGTT ATATCTACAT ATCTGTCATC T AAAT AAATG 1800 

65 GCTTTCAAAC AAAAAAAAAA AAAAAAAAAA AAAAAAA 



70 



f>q m NO: 22 Protein sequence: 
protein Accession ft NPJD03705 



1 11 21 31 41 51 
I I I t I t 

MCAERLGQFM TLALVLATFD PARGTDATNP PEGPQDRSSQ QKGRLSLQNT AHQHCLVNA 60 
GDVGCG VFEC FENNSCBRG 1HGICMTFLH NAGKFDAQGX SHKDA1KCK AHALRHRFGC 120 
75 ISRKCPAIRE MYSQLQRECY LKHDLCAAAQ HNTRVIVEMI HFKDLLLHEP YVDLVNUXT 180 
CGEEVKEAIT HSVQVQCEQN WGSLCSILSF CTSAlQKPPT APPERQPQVD RTKLSRAHHG 240 
EAGHHLPEPS SRETGRGAKG HRGSKSHFNA HARGRVGGLG AQGPSGSSBW EDEQSEYSDI 300 
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Seq ID NO: 23 DNA sequence 

Nucleic Acid Accession ft NMJ005264.1 

Coding sequence: 557-1 954(imdcrtined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

10 1 | I I I | 

GAATTCCGGC CAOAAGAAAT CTGGCCTCGG AACACGOCAT TCTCCGCGCC GCTTCCAATA 60 
AOCACTAACA TCCCTAACGA GCATCCGAGC CGAGGGCTCT OCTCGGAAAT OGTCCT GGOC 120 
CAACTCGGCC CTTCGAGCTC TCG AAGATTA CCGCATCTAT TTTTTTTTTC TTTTTTTTCT 180 
TTTCCTAGOG CAGATAAAGT GAGCCCGGAA AGGGAAGGAG GGGGCGGGGA CACCATTGCC 240 

15 CTGAAAGAAT AAATAAGTAA ATAAACAAAC TGGCTCCTCG CCGCAGCTGG ACXXX5GTCGG 300 
TTG AGTCCAG GTTGGGTCGG ACXTTGAACCC CTAAAAGCGG AACCGCCTCC CQCC CTCG CC 360 
ATCCCGGAGC TGAGTOGOCG GCGGCGGTGG CTGCTGCCAG ACCCGGAGTT TCCTCITTCA 420 
CTGGATGGAG CTGAACTTTG GGCGGCCAGA GCAGCACAGC TGTCCGGGGA TCGCTGCACG 480 
CTGAGCTCCC TCGGCAAGAC CCAGCGGCGG CTCGGG ATTT TTTTGGGGGG GCGGGGACCA 540 

20 GCCCCGCGCC GGCACCATGT TOCTGGCGAC CCTGTACTTC GCGCTGCCGC TCTTGGACTT 600 
GCTCCTGTOG GOOGAAGTGA GCGGCGGAGA CCGCCTGGAT TGCGTGAAAG CCAGTGATCA 660 
GTGCCTGAAG GAGCAGAGCT GCAGCACCAA GTACCGCACG CTAAGGCAGT GCGTGGCGGG 720 
CAAGGAGACC AACTTCAGCC TGGCATCCGG CCTGGAGGCC AAGGATGAGT GCCGCAGCGC 780 
CATGGAGGCC CTGAAGCAGA AGTOGCTCTA CAACTGCCGC TGCAAGCGGG GTATGAAGAA 840 

25 GGAGAAGAAC TGCCTGCGCA TTTACTGGAG CATGTACCAG AGCCTGCAGG GAAATGATCT 900 
GCTGGAGGAT TCCCCATATG AACCAGTTAA CAGCAGATTG TCAGATATAT TCCGGGTGGT 960 
CCCATTCATA TCAGATGTTT TTCAGCAAGT GGAGCACATT CCCAAAGGGA ACAACTGCCT 1020 
GGATGCAGCG AAGGCCTGCA ACCTCGACGA CATTTGCAAG AAGTACAGGT CGGCGTACAT 1080 
CAOCCCGTGC ACCACCAGCG TGTCCAACGA TGTCTGCAAC CGCCGCAAGT GCCACAAGGC 1140 

30 CCTCCGGCAG TTCTTTGACA AGGTCCCGGC CAAGCACAGC TACGGAATGC TCTTCTGCTC 1200 
CTGCCGGGAC ATCGCCTGCA CAGAGCGGAG OCQACAGACC ATCGTGCCTG TCTGCTCCTA 1260 
TGAAG AGAGG GAGAAGCCCA ACTGTTTGAA TTTGCAGGAC TCCTGCAAGA CGAATTACAT 1320 
CTGCAGATCT CGCCITGCGG ATTTTTTTAC CAACTGCCAG CCAGAGTCAA GGTCTGTCAG 1380 
CAGCTGTCTA AAGGAAAACT ACGCTGACTG CCTCCTCGCC TACTCGGGGC TTATTGGCAC 1440 

35 AOTCATGACC CCCAACTACA TAGACTCCAG TAGCCTCAGT GTGGCCCCAT GGTGTGACTG 1500 
CAGCAACAGT GGGAACG ACC TAGAAGAGTG CTTGAAATTT TTGAATTTCT TCAAGGACAA 1560 
TACATGTCTT AAAAATGCAA TTCAAGCCTT TGGCAATGGC TCCGATGTG A CCGTGTGGCA 1620 
GCCAGCCTTC CCAOTACAQA CCACCACTGC CACTACCACC ACTGCOCTCC GGGTTAAGAA 1680 
CAAGCOCCTG GGGOCAGCAG GGTCTG AGAA TGAAATTCGC ACTCATGTTT TCCCACCGTG 1740 

40 TGCAAATTTA CAGGCACAGA AGCTGAAATC CAATGTGTCG GGCAATACAC ACCTCTGTAT 1800 
TTCCAATGGT AATTATG AAA AAGAAGGTCT CGGTGCTTOC AGCCACATAA CCACAAAATC 1860 
AATGGCTGCT CCTCCAAGCT GTGGTCTGAG CCCACTGCTG GTCCTGGTGG TAACCGCTCT 1920 
GTCCACCCTA TTATCTTTAA CAGAAACATC ATAGCTGCAT TAAAAAAATA CAATATGGAC 1980 
ATGTAAAAAG ACAAAAACCA AGTTATCTGT TTCCTGTTCT CTTGTATAGC TOAAATTCCA 2040 

45 GTTTAGGAGC TCAGTTGAGA AACAGTTOCA TTCAACTGGA ACA1 1 1 I'll i 1 1 1 ICCl 1 1 1 2100 
AAGAAAGCTT CTTGTGATCC TTCGGGGCTT CTOTGAAAAA CC IGATG CAG TGCTCCATCC 2160 
AAACTCAGAA GGCTTTGGGA TATGCTGTAT TTTAAAGGGA CAGTTTGTAA CTTGGGCTGT 2220 
AAAGCAAACT GGGGCTGTGT TTTCGATGAT GATGATCATC ATGATCATGA TGATTTTAAC 2280 
AGTTTTACTT CTGGGCTTTC CTAGCTAGAG AAGGAGTT AA TATTTCTAAG GTAACTCCCA 2340 

50 TATCTCCTTT AATGACATTG ATTTCTAATG ATATAAATTT CAGOCTACAT TGATGCCAAG 2400 
CT1TI 1 VJC C ACAAAGAAGA TTCTTAOCAA GAGTGGGCTT T0T0O AAACA GCTGQTACTQ 2460 
ATGTTCACCT TTATATATGT ACTAGCATTT TCCACGCTGA TGTTTATGTA CTGTAAACAG 2520 
TTCTGCACTC TTGTACAAAA GAAAAAACCA CCCGGAATTC 

55 Seq ID NO: 24 Protein sequcocet 

Protein Accession #: NP_0O5255 

1 11 21 31 41 51 

60 MFLATLYFAL PLLDLLLSAE VSGGDRLDCV KASDQCLKEQ SCSTKYRTLR QCVAGKEThff 60 

SLASGLEAKD ECRSAMEALK QKSLYNCRCK RGMKKEKNCL RIYWSMYQSL OGNDLLEDSP 120 
YEPVMSRLSD IFRWPFISD VFQQVEHIPK GNNCLDAAXA C NLDD ICKKY RSAY1TPCTT 180 
SVSND VCNRR KCHKALRQFF DKVPAKH5YG MLFCSCRDIA CTERRRQTIV PVCSYEEREK 240 
PKCLNLQDSC KTNYICRSRL ADFFINCQPE SRSVSSCLKE NYADCULAYS GLIGTVMTPN 300 

65 YIDSSSLSVA PWCDCSNSGN DLEECLKFLN FFKDNTCLKN AIQAFGNGSD VTVWQPAFPV 360 
QTTTATTTTA LRVKNKPLGP AGSBNHPTH VLPPCANLQA OKLK SNVSGN THLOSNGNY 420 
EKEGLGASSH ITTKSMAAPP SCGLSPLLVL WTALSTLLS LTBTS 

Seq D0NO: 25 DNA sequence 
70 Nucleic Acid Accession #: FGENBSH predicted 

Coding sequence: I576(enb>e sequence Teprrsents open reading frame) 

] 11 21 31 41 51 

75 CTrrTGTITC GCCATGCCTA GTCTAGTGGT ATCTGGAATA ATGG AAAGAA ATGGGGGCTT 60 

TOGAGAACTA GGATGTTTCG GGGGAAGCGC TAAGGACOGA GGGCTGCTGG AAGAOGAGCG 120 
CGCCCTTCAO CTGGCTCTCG ATCAACTCTG CXrTCCTGGGT TTGGGGGAGC CCCCCGCCCC 180 
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CAGGGCGGGC GAGGACGGGG G AGGTGGGGG GGOCGGCGCC CCCGCGCAGC CGACAOCCCC 240 
CCCGCAGCCG GCGCCGCCGC ttXXGCCCGC GQCGCCCCCG GCCGCCCCGA raACGCCCCC 300 
CGCAGCGCAG ACGCCCCAGC COCCCACCGC CCCCAAAGGG GCGAGOGACG CCAAGCTCTG 360 
CGCTCTCTAC AAAGAGGCCG AGCTGCGCCT GAAGGGCAGC AGCAACACCA CGGAGTGTGT 420 
5 TCCCGTGCCC ACCTCCGAGC ACGTGGCCGA GATCGTGGGC AGGCAAGGCT GCAAGATTAA 480 
GGCCTTGAGG GCCAAGACCA ACACCTACAT CAAGACACCG GTGAGGGGCG AGGAACCAGT 540 
GTTCATGGTG ACAGGGCGAC GGGAGGACGT GGCCACAGCC CGGCGGGAAA TCATC TCAGC 600 
AGCGGAGCAC TTCTCCATGA TCCGTGCCTC CCGCAACAAG TCAGGCGCCG CCTTTGGTGT 660 
GGCTCCTGCT CTGCCCGGCC AGGTGACCAT CCGTGTGCGG GTGCXXTACC GCGTGGTGGG 720 
10 GCTGGTGGTG GGCCCCAAAG GGGCAACCAT CAAGCGCATC CAGCAGCAAA CCAACACATA 780 
CATTATCACA CCAAGCCGTG ACCGCGACCC CGTGTTCGAG ATCACGGGTG CCCCAGGCAA 840 
CX5TGGAGCGT GCGCGCGAGG AGATCGAGAC GCACATCGCG GTGCGCACTG GCAAGATCCT 900 
CGAGTACAAC AATGAAAACG ACTTCCTGGC GGGGAGCCCC GACGCAGCAA TCGATAGCCG 960 
CTACTCCGAC GCCTGGCGGG TGCAOCAGCC CGGCTGCAAG OCCCTCTCCA CCTTCCGGCA 1020 
15 GAACAGCCTG GGCTGCATCG GCG AGTGCGG AGTGGACTCT GGCTTTGAGG CCCCACGCCT 1080 
GGGTGAGCAG GGCGGGGACT TTGGCTACGG CGGGTACCTC TTTCCGGGCT ATGGCGTGGG 1140 
CAAGCAOOAT OTGTACTACG GCGTGGCCGA GACTAGCCCC CCGCTGTGGG CGGGCCAGGA 1200 
GAACGCCACG CC£ACCTCCG TGCTCTTCTC CIXJTGCCTXXZTCCTCCTOCT CCTCTTOCGC 1260 
CAAGGCCCGC GCTOGGCCCC CGGOCOCACA CCGCTCCCCT GCCACTTCCO CGGGACCCGA 1320 
20 GCTGGCCGGA CTCCCGAGGC GCCCCCCGGG AG AGCCGCTC CAGGGCTTCT CTAAACTTGG 1380 
TOGGGGCOGC CTGCGGAGCC CCGGCGGCGG GCGGGATTGC ATGGTCTGCT TTGAGAGCGA 1440 
AGTG ACTGCC GCCCTTCTGC CCTGCGGACA CAACCTGTTC TGCATOG AGT GTGCAGTAOG 1500 
CATCTGCGAG AGGACGGACC CAGAGTGTCC COTCTGOCAC ATCACAGCCA CGCAAGCCAT 1560 
CCGAATATTC TCCTAA 



25 



Scq ID NO: 26 Protein 



Piotem Accession ft FGENESH predicted 



30 1 11 21 31 41 51 

FCFAMPSLVV SGIMERNGGF GELGCFGGSA KDRGLLEDER ALQLALDQLC LLGIXjBFPAP 60 
RAGEDGGGGG GGAPAQPTAP PQPAPPPPPA APPAAPTTAP AAQTPQPPTA PKGASDAKLC 120 
ALYKEAELRL KGSSNTTECV PVPTSEHVAE IVGRQGCKIK ALRAKTNTYI KTPVRGEEPV 180 

35 FMVTGRREDV ATARREUSA AEHPSMIRAS KNKSGAAPGV APA LPGQV TI RVRVPYRWG 240 
LWGPKGATI KRIQQOTNTY OTPSRDRDP VFE1TGAPGN VERAREEIET HIAVKTGKIL 300 
EYNNENDFLA GSPDAA1DSR YSDAWRVHQP OCKPLSTFRQ NSLGCIGECG VPSGFEAPRL 360 
GEQGGDFGYG GYLFPG YGVG KQDVYYGVAE TSPPLWAGQE NATPTSVLFS SASSSSSSSA 420 
KARAGPPGAH RSPATSAGPE LAGLPRRPPG EPLQGFSKLG GGGLRSPGGG RDCMVCFESE 480 

40 VTAALVPCGH NLFCMECA VR ICERTDPECP VCfflTATQAJ RIPS 



45 



SeqIDNO:27DNAt-^ 

Nucleic Add Accession ft FGENESH predicted ^ 

Coding sequence: l _2070 (underfilled sequences conespond to 



1 11 21 31 41 51 

ATOAGCGGTG COGGGGTGGC GGCTGGGACG CGGCCCCCCA GCTCGCCGAC CCCGGGCTCT 60 
CGGCGCOGGCG<X^GOXXX:CTCTOTXKK3^ 120 

50 CTCAGGCAGA GCGACCCGCA GAAACGGAAC CTGGACCTGG AGAAAAGCCT GCACHTCCTG 180 
CAGCAGCAGC ACTCGGAGAT GCTGOCCAAG CTCCATG AGG AGATCGAGCA TCTGAAGCGG 240 
GAAAACAAGG GTGAGCCGGC GCGGGGCCCT AGGCCGGCCC TGCCTCCCCA GGCACACTCA 300 
ACACTGCCGC TCCCGCAGCA CAGAAACACA GCCATCAACT CCAGCACACG CCTGGGCTCA 360 
GGGGGAACAC AGGACGGGG A GCCCCTCCAG ACTGTCCTTG CCCACCTGGC TGCACTGGCC 420 

55 CCTOTATOCC AACCCAOTOO OTACAGGTTC TGGGGG ACCT GGACAOATGC CGCTACCTCT 480 
AGCCGTGGCT GGACGATGTT ATGCAGCCAA GCACAGCACQ TGCTGCTCTC OGGAAGCCCA 540 
GGGCCTGAGG TCATTGCAGG GCGGCAGGTG GCCACAGGGT GCTCCCCAGA CCTC CCTCCT 600 
CCAAGTAOAQ CTOAAATGGO AAGOAACCCC TGGGACAGCC CCTGCCCTGC TAGATCTTTQ 660 
CCTCAGATTG CTGCTGTGGC CAGGCCCAGG ATTTCCAGCC CTATGGCTCT GAGTCCTCAC 720 

60 ATGCTGGGGO CCCAGGGGAT ATGGACACAC TCCATCCAGQ OATCCCTTCC TGCCATCTGG 780 
GCAGCAACCA TGGGGACAAA GGGAGGAAGC AGAGTCCTGT TTCCTTGCCA CTTGTOCAAG 840 
GCACTTCCCC ATCCTGACAQ CGGCOCCCAC CCAGCCCAGG ATCCTGGGCT GTGGTCTCAA 900 
GCTCACTTCC CATTATCTTT GGGGCTGGGG CTGACATCAG QAOG ACATCT^CTOQ TOGA 960 
TGGAGCCAGC CTGGGAACAT CQCAGCTGGG GCAGTGCCTA GGGCTCTCCC TTOCCAGGGA 1020 

65 G ACATGGAG A AGGGGGTTOA GGGAGGGCCC TTCCCTAGCC GCTGTGGCAA CTCCAGTGAG 1080 
CTGTTCTGGO CAAAGTGTGG CCCAAGTCGG CAGCCCCAGC OTTjCAGTGC TGGGGACGCT 1140 
GACAGGACAC GGGAAGAGGC CATGCTTTCC CTCGGGACCT GCTGTTCCAT GTGTCCCAAG 1200 
CCCTCCTOCT TTCCAGATGG CCCCTCAGGA AACCACCTIT CCAGGGCCTC TGCTCCCTTG 1260 
GOCGCTCGCT GGGTCTGCAT CAACGGAGTG TGGGTAGAGC CGGGAGGACC C^GOOCTGCC 1320 

70 AGGCTGAAGG AGGGCTCCTC ACGGACACAC AGGCCAGGAG GCAAGCGTGG GCuiunuCG 1380 
GGCGGTAGCG CCGACACTGT GCGCTCTCCT GCAGACAGCC TCTCCATGTC AAGCTTCCAG 1440 
TCTGTCAAGT CCATCTCTAA TTCAGOCAAC TCTCAAGGCA AGGCCAGGCC CCAGCCCGGC 1500 
TCCTTCAACA AGCAAGATTC AAAAGCTG AC GTCTCCCAGA AGGCGGACCT GGAAGAGGAG 1560 
CCCCTACTTC ACAACAGCAA GCTGOACAAA GTTCCTGGGG TACAAGGGCA GGCCAGAAAG 1620 

75 GAGAAAGCAG AGGCCTCTAA TGCAGGAGCT GCCTGTATGG GGAACAGCCA GCACCAGGGC 1680 
AGGCAGATGG GGGCGGGGGC ACACCCCCCA ATGATCCTOC OCCTTCCCCT GCOAAAGCCC 1740 
ACCACACTTA GGCAGTGCGA AGTGCTCATC CGCOAGCTGT GGAATACCAA CCTCCTGCAG 1800 
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ACCCAAOAGC TGCGGCACCT CAAGTCCCTC CTGGAAGGGA GCCAGAGGCC CCAGGCAGCC I860 
CCGGAGGAAG CTAGCTTTCC CAGGGACCAA GAAGCCACOC ATTTCCCCAA GGTCTCCACC 1920 
AAGAGCCTCT CCAAGAAATG CCTGAGCCCA CCTGTGGCGG AGCGTGCCAT CCTGCCCGCA 1980 
CTGAAGCAGA CCCCGAAGAA CAA CTTTG CC GAGAGGCAGA AGAGGCTGCA GGCAATGCAG 2040 
5 AAACGGCGCC TGCATCGCTC AGTGCTTISA 

Seg ID NO: 28 Protein sequence: 

Protein Accession ft FGENESH predicted 

10 

1 11 21 31 41 51 

MSGAGV AAGT RPPSSPTFGS RRRRQRPSVG VQSLRPQSPQ LRQSDPQKRN LDLEKSLQFL GO 
QOOHSEMIAK LHEED2HLKR ENKGEPARGP RPALPPQAHS TLPLPQHRNT AINSSTRLGS 120 

15 GGTTOGEPLQ TVLAHLAAJLA PVCQPSGYRF WGTWTDAATS SRCWTMLCSQ AQHVLLSGSP 180 
GPEVIAGRQV ATOCSPDLPP PSRAEMGRNP WDSPCPARSL PQ1AAVARPR KSPMALSPH 240 
MLGAQGIWTH SIQGSLPAIW AATMGTKGGS RVLFPCHLSK ALPHPDSGPH PAQDPGLWSQ 300 
AHFPLSLGLG LTSGGHLTGG WSQPGNIAAG A VPRALFSQG DMEKQVEGGP FPSRCGNSSP 360 
LFWAKCGPSR QPQPCSAGDA DRTREEAMLS LGTCCSMCPK PSCFPDGPSG NHLSRASAPL 420 

20 GARWVONG V WVEPGGPSPA RLKEGSSRTH RFGGXRGRLA GGSADTVRSP ADSLSMSSFQ 480 
SVKSXS^SAN SQGKARPQPG SFNKQDSKAD VSQKADLEEE PLLHNSKLDK VPGVQGQARK 540 
EKAEASNAGA ACMGNSQHQG RQMGAGAHPP MILPLPLRKP TTLRQCEVU RELWNTNLLQ 600 
TQHLRHLKSL LEGSQRPQAA PEEASFPRDQ EATHFPKVST KSLSOCCLSP PVABRAILPA 660 

^ UCQTPKNNFA ERQKRLQAMQ KRRLHRSVL 

Seq ID NO: 29 DNA sequence 

Noclcic Add Accession ft NM_0 123 19.2 

Coding sequence: 138-2405 (underlined sequences correspond to start and stop codons) 

30 

1 ]| 21 31 41 51 

CTCGTGCCGA ATTCGGCACG AGAOCGCGTG TTCGCGCCTG GTAGAGATTT CTCGAAGACA 60 
CCAGTGGGCC CGTGTGGAAC OVAACCTGCG CGCGTGQCCG GOCCGTGGGA C AACGAG GCC 120 

35 GCGGAGACGA AGGCGCAATG GCGAGG AAGT TATCTGTAAT CTTGATCCTG ACCHTTGCCC 180 
TCTCTOTCAC AAATOCOCTT CATGAACTAA AAGCAGCTGC TTT000CCAG AOCACTGAOA 240 
AAATTAGTCC GAATTGGGAA TCTGGCATTA ATGTTGACTT GGCAATTTCC ACACGGCAAT 300 
ATCATCTACA ACAGCTTTTC TACCGCTATG GAGAAAATAA TTCTTTGTCA GTTG AAGGGT 360 
TCAGAAAATT ACTTCAAAAT ATAGGCATAG ATAAGATTAA AAGAATCCAT ATACACCATG 420 

40 ACCACG ACCA TCACTCAGAC CACGAGCATC ACTCAGACCA TGAGCGTCAC TCAGACCATG 480 
AGCATCACTC AG ACCACG AG CATCACTCTG ACCATGATCA TCACTCTCAC CATAATCATG 540 
CTGCTTCTGG TAAAAATAAG CGAAAAGCTC TTTGCCCAGA CCATGACTCA GATAGTTCAG 600 
GTAAAGATCC TAGAAACAGC CAGGGG AAAG GAGCTCACCG ACCAGAACAT GCCAGTGGTA 660 
GAAGGAATGT CAAGGACAGT GTTAGTGCTA GTG AAGTGAC CTCAACTGTG TACAACACTG 720 

45 TCTCTGAAGG AACTCACTTT CTAGAGACAA TAGAOACTCC AAGACCTGGA AAACTCTTOC 780 
CCAAAGATGT AAGCAGCTCC ACTOCACCCA GTGTCACATC AAAG AGCCGG GTG AGCCGGC 840 
TGGCTOGTAG GAAAACAAAT GAATCTGTGA GTGAGCCCOG AAAAGGCTTT ATGTATTCCA 900 
OAAACACAAA TGAAAATCCT CAGGAGTGTT TCAATGCATC AAAGCTACTO ACATCTCATG 960 
GCATGGGCAT CCAGGTTCCG CTGAATGCAA CAGAGTTCAA CTATCTCTGT CCAGCCATCA 1020 

50 TCAACCAAAT TO ATGCTAGA TCTTGTCTGA TTCATACAAG TGAAAAGAAG GCTGA AATCC 1080 
CTCCAAAGAC CTATTCATTA CAAATAGCCT GGGTTGGTGG TTTTATAGCC ATTTCCATC A 1140 
TC^VGTTTCCTlilClClGCT G GGOGTTATCT T AGTGCCTCT CATOAATCGG ow n fTTCA 1200 
AATTTCTCCT GAGTTTCCTT GTGGCACTGG CCOTTGGGAC TTTGAGTGGT GATGCTTTTT 1260 
TACACCTTCT TCCACATTCT CATGCAAGTC ACCACCATAG TCATAGCCAT OAAGAACCAG 1320 

55 CAATGGAAAT GAAAAGAGGA CCACTTTTCA GTCATCTGTC TTCTCAAAAC AT AGAAGAAA 1380 
GTGCCTATTT TGATTCCACG TGGAAGGGTC TAACAGCTCT AGGAGGCCTG TATTTCATGT 1440 
TH 'ITOTTCA ACATGTCCTC ACATTGATCA AACAATTTAA AG ATAAGAAO AAAAAOAATC 1500 
AGAAGAAACC TGAAAATG AT GATG ATGTGG AGATTAAGAA GCAGTTGTCC AAGTATG AAT 1560 
CTCAACTTTC AACAAATGAG GAGAAAGTAG ATACAGATGA TCGAACTGAA GGCTATTTAC 1620 

60 GAGCAGACTC ACAAG AGCCC TCCCACTTTG ATTCTCAGCA GCCTGCAGTC TTGG AAGAAG 1680 
AAGAGOTCAT GATAGCTCAT GCTCATCCAC AGGAAGTCTA CAATGAATAT GTACCCAGAG 1740 
GGTGCAAGAA TAAATGCCAT TCACATTTCC ACGATACACT CGOCCAGTCA GACG ATCTCA 1800 
TTCACCACCA TCATGACTAC CATCATATTC TCCATCATCA CCAOCACCAA AACCACCATC 1860 
CTCACAOTCA CAGCCAGCGC TACTCTCGGG AGGAGCTGAA AGATGCOGGC GTCGCCACTT 1920 

65 TGGCCTGG AT GGTGATAATG GGTGATGGCC TGCACAATTT CAOCOATGGC CTAGCAATTO 1980 
GTGCTGCTTT TACTG AAGGC TTATCAAOTG GTTTAAGTAC TTCTGTTGCT GTGTTCTGTC 2040 
ATGAGTTGCC TCATGAATTA GGTGACTTTG CTGTTCTACT AAAG GCTG GC ATG ACCGTTA 2100 
AGCAGGCTGT CCTTTATAAT GCATTGTCAQ CCATGCTOOC OTATCTTGGA ATGGCAACAG 2160 
GAATTTTCAT TGGTCATTAT GCTGAAAATG TTTCTATGTG GATATTTGCA CTTACTGCTG 2220 

70 GCTTATTCAT GTATGTTGCT CTGGTTGATA TGGTACCTG A AATGCTGCAC AAT OATGCT A 2280 
GTG ACCATGG ATGTAGOCGC TGGGOGTATT TCTTTTTACA OAATGCTGGG ATGCTTTTGG 2340 
OTTTTGGAAT TATGTTACTT ATTTCCATAT TTGAACATAA AATCGTGTTT CGTATAAATT 2400 
TCTAGTTAAG GTTTAAATGC TAGAGTAGCT TAAAAAGTTG TCATAGTTTC AGTAGGTCAT 2460 
AfifiOAQATOA OTTTOTATGC TGTACTATGC AGCGTTTAAA QTTAQTGGGT TTTGTGATTT 2520 

75 TTGTATTGAA TATTGCTGTC TGTTACAAAG TCAGTTAAAG GTACGTTTTA ATATT TAAOT 2580 
TATTCTATCT TGGAGATAAA ATCTGTATGT GCAATTCACC GGTATTACCA GTTTATTATG 2640 
TAAACAAGAG ATTTGGCATG ACATGTTCTG TATGTTTCAG GGAAAAATGT CTTTAATGCT 2700 
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1 11 21 31 43 51 
I I I I I I 

MERRRLWGSI QSRYlSviSVW TSPRRLVELA GQSUJCDEAL AlAALELLPR ELFPPLFMAA 60 
FDGRHSQTIX AMVQAWPFTC LPLGVLMKGQ HLHLETFKA V LDGLDVLLAQ EVRPRRWKLQ 120 
VLDLRKNSHQ DFWTVWSGNR ASLYSFPEPE AAQPMTKKRK VDOLSTEAEQ PFIPVEVLVD 180 
LFLKEGACDE LFSYUEKVK RKKNVLRLCC KKLKJFAMPM QDKM1LKMV QLDSIEDLBV 240 
TCTWKLPTLA KFSPYLGQMI KLRRLLLSHI HASSYISPEK EEQYIAQFTS QFLSLQCLQA 300 
LYVDSLFFLR GRLDQLLRHV MNPLETLSTT NCRLSEODVM HLSQSPSVSQ LSVLSLSGVM 360 
LTDVSPEPLQ AIXERASATL QDLVFDECGI TDDQLLAIXP SLSHCSQLTT LSFYGNSISI 420 
SALQSLLQHL IGLSNLTHVL YPVPLESYED IHGTLHLERL AYLHARLREL LCELGRPSMV 480 
WLSANPCPHC GDRTFYDPEP ILCPCFMPN 



SeqIDNO: 111 DMA sequence 

Nncfcfe Acid Accession ft NM_O03815 

Coding sequence: 8-2452 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

CGCTGCCATQ CGGCTGGCGC TGCTCTGGGC CCTGGGGCTC CTGGGCGCGG GCAGCCCTCT 60 
GCCTTCCTGG CCGCTCCCAA ATATAGGTGG CACTGAGGAG CAGCAGGCAG AGTCAGAGAA 120 
GGCCCCGAGO GAGCCCTTGG AGCCCCAGGT CCTTCAGGAC GATCTCCCAA TTAGCCTCAA 180 
AAAGGTGCTT CAGACCAGTC TGCCTGAGCC CCTGAGGATC AAGTTGGAGC TGGACGGTGA 240 
CAGTCATATC CTGGAGCTGC TACAGAATAG GG AGTTGGTC CCAGGOOGOC CAACCCTGGT 300 
GTGGTACCAG CCCGATGGCA CTCGGGTGGT CAGTGAGGGA CACACTTTGG AGAACTGCTG 360 
CTACCAGGGA AGAGTGOOGG GATATGCAGG CTCCTGGGTG TCCATCTGCA CCTGCTCTGG 420 
GCTCAGAGGC TTGGTGGTCC TG ACCCCAGA GAGAAGCTAT ACCCTGGAGC AGGGGOCTGG 480 
GGACCTTCAG GGTOCTCCCA TTATTTOGCG AATCCAAGAT CTCCACCTGC CAGGOCACAC 540 
CTGTGCCCTG AGCTGGCGGG AATCTGTACA CACTCAGACG CCACCAGAGC ACCCCCTGGG 600 
ACAGCGCCAC ATTCGCCGGA GGOGGGATGT GGTAACAG AG AOCAAGACTG TGGAGTTGGT 660 
GATTGTGGCT GATCACTCGG AGGCCCAGAA ATACCGGGAC TTCCAGCACC TGCTAAACCG 720 
CACACTGGAA GTGGCCCTCT TGCTGG ACAC ATTCTTCCGG CCCCTG AATG TACGAGTGGC 780 
ACTAGTGGGC CTGGAGGCCT GGACCCAGCG TGACCTGGTG GAGATCAGCC CAAACCCAGC 840 
TGTCACCCTC GAAAACTTCC TCCACTGGCG CAGGGCACAT TTGCTGOCTC GATTGCCCCA 900 
TGACAGTGCC CAGCTGGTC A CTGGTACTTC ATTCTCTGGG (XTACGGTGG GCATGGCCAT 960 
TCAGAACTCC ATCTGTTCTC CTGACTTCTC AGGAGGTGTG AACATGGACC ACTCCACCAG 1020 
CATCCTGGGA GTCGCCTCCT CCATAGCCCA TGAGTTGGGC CACAGCCTGG GCCTGGACCA 1080 
TGATTTGCCT GGGAATAGCT GCCCCTGTCC AGGTCCAGCC CCAGCCAAGA CCTGCATCAT 1140 
GGAGGCCTCC ACAGACITOC TACCAGGCCT GAACTTCAGC AACTGCAGCC GACGGGCCCT 1200 
GGAGAAAGCC CTCCTGGATG GAATGGGCAG CTGCCTCTTC GAACGGCTGC CTAGCCTACC 1260 
CCCTATGGCT GCTTTCTGCG GAAATATGTT TGTGGAGCCG GGCGAGCAGT GTGACTGTGG 1320 
CTTCCTGGAT GACTGCGTCG ATCCCTGCTG TGATTCTTTG ACCTGCCAGC TGAGGCCAGG 1380 
TGCACAGTGT GCATCTGACG OACCCTGTTG TCAAAATTGC CAGCTGCGCC CGTCTGGCTG 1440 
GCAGTGTCGT CCTACCAGAG GGGATTGTGA CTTGCCTGAA TTCTGCCCAG GAGACAGCTC 1500 
CCAGTGTCCC CCTGATGTCA GCCTAGGGGA TGGCGAGCCC TGCGCTGGCG GGCAAGCTGT 1560 
GTGCATGCAC GGGCGTTGTG CCTCCTATGC CCAGCAGTGC CAGTCACTTT GGGGACCTGG 1620 
AGCCCAGCCC GCTGCGCCAC TTTGCCTCCA GACAGCTAAT ACTOGGGGAA ATGCTTTTGG 1680 
G AGCTGTGGG OGCAACOOCA GTCGCAGTTA TGTGTCCTGC AOCOCTAGAG ATGCCATTTC 1740 
TGGGCAGCTC CAGTGCCAGA CAGGTAGGAC CCAGCCTCTG CTGGGCTCCA TCCGGGATCT 1800 
ACTCTGGGAG ACAATAGATG TQAATGGGAC TGAGCTG AAC TGCAGCTGGG TGCACCTGGA 1860 
CCTGGGCAGT G ATGTG6CCC AGCCCCTCCT GACTCTGOCT GGCACAGCCT GTGGCCCTGG 1920 
CCTGGTGTGT ATAG ACCATC GATGCCAGCG TGTGGATCTC CTGGGGGCAC AGGAATGTCG 1980 
AAGCAAATGC CATGGACATG GGGTCTGTGA CAGCAACAGG CACTGCTACT GTGAGGAGGG 2040 
CTGGGCACCC CCTGACTGCA CCACTCAGCT CAAAGCAACC AGCTCCCTGA CCACAGGGCT 2100 
GCTOCTCAGC CICCKKncr T AI T G GT C CT GGTGATGCIT GGTGOCGGCT ACTGGTACCG 2160 
TGCCCGCCTG CACCAGCGAC TCTGCCAGCT CAAGGGACCC ACCTGCCAGT ACAGGGCAGC 2220 
CCAATCTGGT CCCTCTGAAC GGCCAGGACC TCCGCAGAGG GCCCTGCTGG CACGAGGCAC 2280 
TAAGTCTCAG GGGCCAGOCA AGCCCCCACC CCCAAGGAAG CCACrOCCTG CCGACCCCCA 2340 
GGGCCGGTGC CCATOGGGTG ACCTGCCCGG COCAGGGGCT GGAATCCCGC CCCTAGTGGT 3400 
ACCCTCCAGA CCAGCGCCAC CGCCTCCG AC AGTGTCCTCG CTCTACCT CT GAC CTCTCCO 2460 
GAGGTTCCGC TGCCTCCAAG CCGGACTTAG GGCTTCAAGA GOCGGGCQTG CCCTCTGGAG 2520 
TCCCCTACCA TGACTGAAGG CGCCAGAGAC TGGCGGTGTC TTAAGACTCC GGGCACCGCC 2580 
ACGCGCTGTC AAGCAACACT CTGCGGACCT GCCGGCGTAG TTGCAGCGGG GGCTTGGGG A 2640 
GGGGCTGGGG GTTGGACGOG ATTGAGGAAG GTCCGCACAG CCTGTCTCTG CTCAGTTGCA 2700 
ATAAACGTGA CATCTTGGCA GCGTTAAAAA AAAAAAAAAA 

SeqIDNO: 112 Protein sequence: 
Proteb Accession*: NPJD038062 

1 II 21 31 41 51 
I I I I I I 

MRLALLWALG LLGAGSPLPS WPLPNIGGTB EQQAESEKAP REPLEPQVLQ DDLP1SLXKV 60 
LQTSLPEPLR KLELDGDSH ILELLQNRHL VPGRPTLVWY QPDGTRWSE GHTLENCCYQ 120 
GRVRGYAGSW VS3CTCSGLR GLWLTPERS YTLEQGPGDL QGPPIISRIQ DLHLPGHTCA 180 
LSWRESVHTQ TPPEHPLGQR HIRRRRDWT ETKTVELVTV ADHSEAQKYR DFQHLLNRTL 240 
EVALLLDTFF RPLNVRVALV GLEAWTQRDL VElSPNPAVr LENFLHWRRA HLLPRLPHDS 300 
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10 



60 



AQLVTGTSFS GPTVGMAIQN SICSPDFSGG VNMDHSTSIL GVASS1AHEL GHSLGLDHDL 360 
PGNSCPCPGP APAKTC3MEA STDFLPGLNP SNCSRRALEK ALLDGMGSCL FERLPSLPPM 420 
AAFCGNMFVE PGEQCDCGFL DDCVDPOCDS LTCQLRPGAQ CASDGPCCQN CQLRPSGWQC 480 
RPTRGDCDLP EFCPODSSQC PPDVSLGDGE PCAGGQAVCM HGRCASYAQQ CQSLWGPGAQ 540 
PAAPLCLQTA NTRGNAFGSC GRNPSGSYVS CTPRDAICGQ LQCQTGRTOP LLGSIRDLLW 600 
ETIDVNGTELNCSWVHLDLG SDVAQPLLTL PGTACGPGLV ODHRCQRVD LLGAQECRSK 660 
CHGHGVCDSN RHCYCKEGWA PPDCTTQLKA TSSLTTGLLL SLLVLLVLVM LGAGYWYRAR 720 
LHQRLCQLKG PTCQYRAAQS GPSERPGPPQ RALXARGTKS QGPAKPPPPR KPLPADPQGR 780 
CPSGDLPGPG AGIPPLWPS RPAPPPPTVS SLYL 

Seq ID NO. 1 13 DNA sequence 

Nnckic Acid Accession ft NM0024I6 

Coding sequence: 40-417 (underlined sequences conespond tD start and stop codons) 



IS 1 II 21 31 41 St 

ATCCAATACA GGAGTGACTT GGAACTCCAT TCTATCACTATGAAGAAAAG TGGTGTTCTT 60 
TTCCTCTTGG GCATCATCTT GCTGGTTCTG ATTGGAGTGC AAGGAACCCC AGTAGTGAGA 120 
AAGGGTCGCT GTTOCTGCAT CAGCACCAAC CAAGGGACTA TCCACCT ACA ATCCTTG AAA 180 
20 GAOCTTAAAC AATTTGCCCC AAGO0CTT0C TGCGAGAAAA TTGAAATCAT TGCTACACTG 240 
AAGAATGGAG TTCAAACATG TCTAAACCCA GATTCAGCAG ATGTOAAGGA ACTGATTAAA 300 
AAGTGGGAGA AACAGGTCAG CCAAAAGAAA AAGCAAAAGA ATGGGAAAAA ACATCAAAAA 360 
AAGAAAGTTC TGAAAGTTCG AAAATCTCAA CGTTCTCGTC AAAAGAAGAC TACATAAGAG 420 
ACCACTTCAC CAATAAGTAT TCTGTGTTAA AAATGTTCTA TTTTAATTAT ACCGCTATCA 480 
25 TTCCAAAGGA GGATGGCATA TAATACAAAG GCTTATTAAT TTGACTAGAA AATTTAAAAC 540 
ATTACTCTGA AATTGTAACT AAAGTTAGAA AGTTGATTTT AAGAATCCAA ACGTTAAGAA 600 
TTGTTAAAGG CTATGATTGT C1I1UHC11 CTACCACCCA CCAGTTGAAT TTCATCATGC 660 
TTAAGGCCAT GATTTTAGCA ATACCCATGT CTACACAGAT GTICAOCCAA CCACATCCCA 720 
CTCACAACAG CTGCCTGGAA GAGCAGCCCT AGGCTTCCAC GTACTGCAGC CTCCAGAGAG 780 
30 TATCTGAGGC ACATGTCAGC AAGTCCTAAG CCTGTTAGCA TGCTGGTGAG CCAAGCAGTT 840 
TGAAATTGAG CTGGACCTCA CCAAGCTGCT GTGGCCATCA ACXTCTGTAT TTGAATCAGC 900 
CTACAGGCCT CACACACAAT GTGTCTGAGA GATTCATGCT GATTGTTATT GGGTATCACC 960 
ACTGGAGATC ACCAGTGTGT GGCTTTCAGA GCCTCCTITC TGGCTTTGGA AGCCATGTGA 1020 
_ _ TTCCATCTTG CCCGCTCAGG CTGACCACTT TATTTCTTTT TGTrCCCCTT TGCTTCATTC 1080 
35 AAGTCAGCTC TTCTOCATCC TACCACAATG CAGTGCCTTT CTTCTCTOCA GTGCACCTGT 1140 
CATATGCTCT GATTTATCTG AGTCAACTCC TTTCTCATCT TGTCCCCAAC ACCCCACAGA 1200 
AGTGCrrrCT TCTCOCAATT CATCCTCACT CAGTOCAGCT TAGTTCAAGT CCTGCCTCTT 1260 
AAATAAACCT TTTTGGACAC ACAAATTATC TTAAAACTCC TCnTCACTT GGTTCAGTAC 1320 
Ark CACATGGGTG AACACTCAAT GGTTAACTAA TTCTTGGGTG TTTATCCTAT CTCTCCAACC 1380 
40 AGATTQTCAG CTCCTTGAGG GCAAGAGCCA CAGTATATTT OOCTGTTTCT TCCACAGTGC 1440 
CTAATAATAC TGTGGAACTA GGTTTTAATA ATTTTTTAAT TGATGTTGTT ATGGGCAGGA 1500 
TGGCAACCAO ACCATTGTCT CAGAGCAGGT GCTGGCTCTT TCCTGGCTAC TCCATGTTGG 1560 
CTAGCCTCTG GTAACCTCTT ACTTATTATC TTCAGGACAC TCACTACAGG GACCAGGGAT 1620 
Ar . G ATGCAACAT CC 1 101C1 J 1 TTATGACAGG Alt/1 IIGCIC AGCTTCTCCA ACAATAAGAA 1680 
45 GCACGTGGTA AAACACTTGC GGATATTCTG GACTGTTTTT AAAAAATATA CAGTTTACOG 1740 
AAAATCATAT AATCTTACAA TGAAAAGGAC TTTATAGATC AGCCAGTGAC CAACCTTTTC 1800 
CCAACCATAC AAAAATTCCT TTTCCCGAAG G AAAAGGGCT TTCTCAATAA GCCTCAGCTT 1860 
TCTAAGATCT AACAAGATAG CCACCGAGAT CCTTATCGAA ACTCATTTTA GGCAAATATG 1920 
AGTTTTATTG TCCGTTTACTTGTTTCAGAG TTTGTATTGT GATTATCAAT TACCACACCA 1980 
50 TCTCCCATGA AGAAAGGGAA CGGTGAAGTA CTAAGCGCTA GAGGAAGCAG CCAAGTCGGT 2040 
TAGTGGAAGC ATGATTGGTG CCCAGTTAGC CTCTGCAGGA TGTGGAAACC TCCTTCCAGG 2100 
GGAGGTTCAG TGAATTGTGT AGGAG AGGTT GTCTGTGGCC AGAATTTAAA CCTATACTCA 2160 
CTTTOCCAAA TTGAATCACT GCTCACACTG CTGATGATTT AGAGTGCTGT CCGGTGGAGA 2220 
TCCCACOCG A ACGTCTTATC TAATCATGAA ACTCCCTAGT TCCTTCATGT AACTTCCCTG 2280 
55 AAAAATCTAA GTGTTTCATA AATTTGAGAG TCTGTGACCC ACTTAOCTTG CATCTCACAG 2340 
GTAGACAGTA TATAACTAAC AACCAAAGAC TACATATTGT CACTGACACA CACGTTATAA 2400 
TCATTTATCA TATATATACA TACATGCATA CACTCTCAAA GCAAATAATT TTTCACTTCA 2460 
AAACAGTATT G ACTTGTATA CCrrGTAATT TGAAATATTT TCTTTGTTAA AATAG AATGG 2520 
TATCAATAAA TAGACCATTA ATCAG 



SeqIDNO: 1 14 Protein sequence: 
Protein Accession fh NP_002407 



I 11 21 31 41 51 

65 i I I l I l 

MKKSGVLFLL GHLLVLIGV QGTPWRKGR CSC1STNQGT IHLQSLXDLK QFAPSPSCEK 60 
IEIIATLKNG VOTCLNPDSA DVKEUKKWE KQVSQKKKQK NGXKHQKKKV LKVRKSQRSR 120 . 
QKKTT 

70 Seq ID NO: 1 15 DNA sequence 

Nucleic Acid Accession #: NMJD03238.1 

Coding sequence: 1 82-1426 (underlined sequences c ui ie &poud to start and stop codons) 
„ 1 II 21 31 41 51 

75 | | i || | 

CAAGCAGGAT ACGTTTTTCT GTTGGGCATT GACTAGATTG TTTGCAAAAG TTTCGCATCA 60 
AAAACAAACA ACAACAACAA AAAACCAAAC AACTCTCCTT GATCTATACT TTGAGAATTG 120 
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TTGATTTCTT 'I Tl TIT J ATT CTGACTTTTA AAAACAACTT 1 1 MT1CCAC TTTTTTAAAA 180 
AATGCACTAC TOTGTGCTOA GCOCrTTTCT GATCCTGCAT CTGGTCACGG TCGCOCTCAO 240 
CCTGTCTACC TGCAGCACAC TCGATATGGA CCAGTTCATG CGCAAGAGGA TCGAGGCGAT 300 
_ CCGCGGGCAG ATCCTGAGCA AGCTGAAGCT CACCAGTCCC CCAGAAGACT ATCCTGAGCC 360 
5 CG AGGAAGTC CCCCCGGAGG TGATTTCCAT CTACAACAGC ACCAGGGACT TGCTCCAGGA 420 
GAAGGCGAGC CGGAGGGCGG CCGCCTGCG A GCGCGAGAGG AGCGACGAAG AGTACTACGC 480 
CAAGG AGGTT T ACAAAATAG ACATGCCGCC CTTCTTCCCC TCOG AAAATG OCATCCOGOC 540 
CACTTTCTAC AGACOCTACT TCAGAATTGT TCG ATTTGAC GTCTCAGCAA TGGAGAAGAA 600 
TGCTTCCAAT TTGGTGAAAG CAOAGTTCAG AGTCTTTCGT TTGCAGAACC CAAAAGCCAQ 660 

10 AGTGCCTGAA CAACGG ATTG AGCTATATCA GATTCTCAAG TCCAAAGATT TAACATCTCC 720 
AACCCAGCGC TACATCOACA GCAAAGTTGT GAAAACAAGA GCAGAAGGCG AATGGCTCTC 780 
CTTOGATGTA ACTGATGCTG TTCATGAATG GCTTCACCAT AAAGACAGGA ACCTGGGATT 840 
TAAAATAAGC TTACACTGTC CCTGCTGCAC TTTTGTACXA TCTAATAATT ACATCATCCC 900 
AAATAAAAGT GAAGAACTAG AAGCAAGATT TGCAGGTATT GATGGCACCT CCACATATAC 960 

15 CAGTGGTGAT CAGAAAACT A TAAAGTCCAC TAGGAAAAAA AACAGTGGG A AGACCCCACA 1020 
TCTCCTGCTA ATGTTATTGC (XTCCTACAG ACTTGAGTCA CAACAGACCA ACCGGCGG AA 1080 
GAAGCGTGCT TTGGATOCGG CCTATTGCTT TAG AAATGTG CAGGATAATT GCTGCCTACG U40 
TCCACTTTAC ATTGATTTCA AGAGGGATCT AGGGTGGAAA TGGATACACG AACCCAAAGG 1200 
GTACAATGCC AACTTCTGTG CTGGAGCATG CCCGTATTTA TGGAGTTCAG ACACTCAGCA 1260 

20 CAGCAGGGTC CTG AGCTTAT ATAATACCAT AAATCCAG AA GCATCTGCTT CTOCTTGCTG 1320 
GGTGTCCCAA GATTTAGAAC CTCTAACCAT TCTCTACTAC ATTGGCAAAA CACCCAAGAT 1380 
TGAACAGCTT TCTAATATGA TTGTAAAGTC TTGCAAATGC AGCTAAAATT CTTGGAAAAG 1440 
TGGCAAGACC AAAATGACAA TGATGATGAT AATGATGATG ACGACGACAA CGATGATGCT 1500 
TGTAACAAG A AAACATAAGA GAGCCTTGGT TCATCAGTGT TAAAAAATTT TTGAAAAGGC 1560 

25 GGTACTAGTT CAGACACTTT GGAAGTTTGT Gl lCI UUi U TTAAAACTGG CATCTGACAC 1620 
AAAAAAAGTT G AAGGCCTTA TTCTACATTT CACCTACTTT GTAAGTGAGA GAGACAAGAA 1680 
GCAAATTTTT TTAAA 

ScqIDNO: 116 Protein sequence: 
30 Protein Accession ft NP_003229.I 

1 11 21 31 41 51 
I I I I I I 

MHYCVLSAFL ILHLVTVALS LSTCSTLDMD QFMRKRIEAI RGQJLSKLKL TSPPEDYPEP 60 
35 EEVPPEVISI YNSTRDLLQB KASRRAAACB RERSDEEYYA KEVYKIDMPP FFPSENAIPP 120 
TFYRPYFRIV RPDVSAMEKN ASNLVKAEFR VFRLQNPKAR VPEQRIELYQ ILKSKDLTSP ISO 
TQRYIDSKW KTRAEGEWLS FDVTDAVHEW LHHKDRNLGF KISLHCPCCT FVPSNNYIIP 240 
NKSEELHARF AGIDGTSTYT SGDQKTIKST RKKNSGKTPH LLLMLLPSYR LESQQTNRRK 300 
KRALDAAYCF RNVQDNCCLR PLYIDFKRDL GWKWIHEPKG YNANFCAG AC PYLWSSDTQH 360 
40 SRVLSLYNTI NPEASASPCC VSQDLEPLTT LYYIGKTPKI BQLSNMTVKS CKCS 

ScqIDNO: 117 DNA sequence 

Nucleic Acid Accession #: NM 000095.1 

Coding sequence: 26-2299 (tiDderiiDed scquesces correspond to start and stop codans) 

45 

I 11 21 31 41 51 
1 I I I I I 

CAGCACCCAG CTCCCCGCCA CCGCCATOGT CCCCGACACC GCCTGCGTTC TTCTGCTCAC 60 
CCTGGCTGOC CTCGGCGCGT CCGGACAGGG CCAGAGOCCQ TTGGGCTCAG ACCTGGGOCC 120 

50 GCAGATGCTT CGGGAACTGC AGO AAACCAA CGCGGCGCTG CAGGACGTGC GGGACTGGCT 180 
GCGGCAGCAG GTCAGGGAGA TCACOTTCCT GAAAAACACG GTGATGOAGT GTGACGCGTG 240 
CGGGATGCAG CAGTCAGTAC GCACCGGCCT AOCCAGCGTG CGGOOCCTGC TCCACTGCGC 300 
GCCCGGCTTC TGCTTCCCCG GCGTGGOCTG CATCCAG ACG GAGAGCGGCG GCCGCTGCGG 360 

„ CCCCTGCXX3C GCGGGCTTCA CGGGCAACGG CTCGCACTGC ACCGACGTCA ACGAGTGCAA 420 

55 CttCCCACCCC TGCTTCCCCC G AGTCOGCTG TATCAACACC AGCCCGGGGT TCCGCTGCGA 480 
GGCTTGCCCG COGGGGTACA GCGGCCCCAC CCACCAGGGC GTGGGGCTGG CTTTCGCCAA 540 
GGCCAACAAG CAGGTTTGCA CGGACATCAA CGAGTGTGAG ACCGGGCAAC ATAACTGCGT 600 
CCOCAACKX:GTGnrGCATCAACAOCCOGGGCrC^ 660 
CTTOGTGGGC GACCAGGCGT CCGGCTGCCA GCGCGGCGCA CAGCGCTTCT GCCCCGACGG 720 

60 CTCGCCCAGC GAGTGCXACG AGCATGCAGA CTGCGTCCTA GAGCGCGATG GCTCGCGGTC 780 
GTGCGTGTGT CGCGTTGGCT GGGCCGGCAA CGGGATCCTC TGTGGTOGCG ACACTG ACCT 840 
AG ACGGCTTC CCGGACGAGA AGCTGCGCTG CCCGGAGCCG CAGTGCCGTA AGGACAACTG 900 
CGTG ACTGTG CCCAACTCAG GGCAGG AGG A TGTGG ACCGC G ATGGCATOG GAG ACGCCTG 960 
CGATCCGGAT GCCGACGGGG ACGGGGTOCC CAATGAAAAG GACAACTGCC CGCTGGTGCG 1020 

65 GAACOCAGAC CAGCGCAACA CGGACGAGGA CAAGTGGGGC GATGCGTGCG ACAACTGCOG 1080 
GTCOCAGAAG AACGACGACC AAAAGGACAC AGACCAGGAC GGCCGGGGCG ATGCGTGCGA 1140 
CGAOGACATC GACGGCGACC GGATCCGCAA CCAGGCCGAC AACTGCOCTA GGGTACCCAA 1200 
CTCAGACCAG AAGGACAGTG ATGOCQATGG TATAGGGGAT GCCTOTGACA ACTGTCCCCA 1260 
G AAG AGCAAC CCGOATCAGG CGGATGTGGA CCACGACTTT GTGGGAGATG CTTGTGACAG 1320 

70 CGATCAAGAC CAGGATOGAG ACGGACATCA GGACTCTCGG GACAACTGTC CCACGGTGCC 1380 
TAACAGTGCC CAGGAGGACT CAG ACCACGA TGGCCAGGGT GATGCCTGCG ACGACGACGA 1440 
CGACAATG AC GGAGTCCCTG ACAGTCGGGA CAACTGCCGC CTGGTGCCTA ACCCCGGCCA 1500 
GGAGGACGCG GACAGGGACG GCGTGGGOGA CGTGTGCCAG GACQACTTTO ATGCAGACAA 1560 
GGTGGTAGAC AAGATCGACG TGTGTCCGGA GAACGCTGAA GTCACGCTCA CCGACTTCAG 1620 

75 GGCCTTCCAG ACAGTCOTGC TGGACCCGG A GGGTGACGCG CAGATTGACC CCAACTGGGT 1680 
GGTGCTCAAC CAGGGAAGGG AGATCGTGCA GACAATGAAC AGCGACCCAG GCCTGGCTGT 1740 
GGGTTACACT GCCTTCAATG GCGTGGACTT CGAGGGCACG TTCCATGTGA ACACGGTCAC 1800 
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GGATGACGAC TATGCGGGCT TCATCTTTGG CTACCAOGAC AGCTCCAGCT TCTACGTGOT I860 
CATGTGG AAG CAG ATGGAGC AAACOTATTG GCAGGCGAAC CCCTTCOGTO CTGTGGCCGA 1920 
GCCTGGCATC CAACTCAAGG CTGTGAAGTC TTCCACAGGC CCCGGGGAAC AGCTGCGGAA 1980 
CGCTCTGTGG CATACAGGAG ACACAGAGTC CCAGGTGCGG CTGCTGTGG A AGGACCCGCG 2040 
AAACGTGGGT TGGAAGGACA AGAAGTCCTA TCQTTGGTTC CTGCAGCACC GGCCCCAAGT 2100 
GGGCTACATC AGGGTGCGAT TCTATGAGGG COCTGAGCTG GTGGCCGACA GCAACGTGGT 2160 
CTTGGACACA ACCATGCGGG GTGGCCGCCT GGGGGTCTTC TGCTTCTCCC AGGAGAACAT 2220 
CATCTGGGCC AACCTGCGTT ACCGCTGCAA TGACACCATC CCAGAGGACT ATGAGACCCA 2280 
TCAGCTGCGG CAAGOCTAGG GACCAGGGTG AGGAGCCGOC GGATGACAGC CACCCTCACC 2340 
GCGGCTGGAT GGGGGCTCTG CACCCAGCCC AAGGGGTGGC CGTCCTGAGG GGGAAGTGAG 2400 
AAGGGCTCAG AGAGGACAAA ATAAAGTGTG TGTGCAGGG 

SeqIDNO: 118 Protein seontnce: 
Protein Accession *: NP_000086.1 

1 11 21 31 41 51 
I I I I I I 

MVPDTACV1X LTLAALGASG QGQSPLGSDL GPQMLRELQB TNAALQDVRD WLRQQVREIT 60 
FLKNTVMBCD ACGMQQSVRT GLPSVRPLLH CAPGFCFPGV ACIQTESGGR CGPCPAGFTG 120 
NGSHCTDVNE CNAHPCFPRV RCINTSPGFR CEACPPGYSG PTHQGVGLAF AKANKQVCTD ISO 
INECETGQHN CVPNSVCINT RGSFQCGPOQ PGFVGDQASG CQRGAQRFCP DGSPSBCHEH 240 
ADCVLERDGS RSCVCRVGWA GNGILCGRDT DLDGFPDEKL RCPEPOCRKD NCVTVPNSGQ 300 
EDVDRDGIGD ACDPDADGDG VPNEKDNCPL VRNPDQRNTD EDKWGD ACDN CRSQKNDDOK 360 
DTDQDGRGDA CDDDIDGDR1 RNQADNCPRV PNSDQKDSDG DG1GDACDNC PQKSNPDQAD 420 
VDHDFVGDAC DSDQDQDGDG HQDSRDNCPT VPNSAQEDSD HDGQGDACDD DDDNDGVPDS 480 
RDNCRLVPNP GQEDADRDGV GDVCQDDFDA DKWDKIDVC PENAEVTLTD FRAFQTWLD 540 
PEGDAQJDPN WWLNQGREI VQTMNSDPGL AVGYTAFNGV DFEGTFHVNT VTDDDYAGFI 600 
FGYQDSSSFY VWIWKQMEQT YWQANPFRAV AEPGIQLKAV KSSTCPGEQL RNALWHTGDT 660 
ESQVRLLWKD PRNVGWKDKK SYRWFLQHRP QVGYIRVRFY EGPELV ADSN WLDTTMRGQ 720 
RLGVFCFSQE NHWANLRYR CNDTIPEDYB IHQLRQA 

SeqIDNO: 119 DNA sequence 

Nucleic Acid Accession ft NMJD1 421 1 

Coding sequence: 157-1479 (underlined sequences correspond bo start and stop codons) 

1 (I 21 31 41 51 
I I I I I I 

GGGACAGGGC TGAGGATGAG GAGAACCCTG GGGACCCAGA AGACCGTGCC TTGCCCGGAA 60 
GTCCTGCXTTG TAGGCCTGAA GGACTTGCCC TAACAGAGCC TCAACAACTA CCTGGTGATT 120 
CCTACTTCAG CCCCTTGCTG TGAGCAGCTT CTCAA CATGA ACTACAGCCT CCACTTGGCC 180 
TTCGTGTGTC TGAGTCTCTT CACTGAGAGG ATGTGCATCC AGGGGAGTCA GTTCAACGTC 240 
GAGGTCGGCA GAAGTGACAA GCTTTCCCrG CCTGGCTTTG AGAACCTCAC AGCAGGATAT 300 
AACAAATTTC TCAGGCCCAA TTTTGGTGGA GAACCCGTAC AGATAGCGCT GACTCTGGAC 360 
ATTGCAAGTA TCTCTAGCAT TTCAGAGAGT AACATGGACT ACACAGCCAC CATATACCTC 420 
CGACAGCGCT GGATGGACCA GCGGCTGGTO TTTGAAGGCA ACAAGAGCTT CACTCTGGAT 480 
GCCCGCCTCG TGGAGTTCCT CTGGGTGCCA GATACTTACA TTGTGGAGTC CAAGAAGTCC 540 
TTCC7XCATG AAGTCACTGT GGGAAACAGG C7XATCCOCC TCTTC7XCAA TGGCACGGTC 600 
CTGTATGCCC TCAGAATCAC GACAACTGTT GCATGTAACA TGGATCTGTC TAAATACCCC 660 
ATGGACACAC AGACATGCAA GTTGCAGCTG GAAAGCTGGG GCTATGATGG AAATGATGTG 720 
GAGTTCACCT GGCTGAGAGG GAACGACTCT GTGCGTGGAC TGGAACACCT GCGGCTTGCT 780 
CAGTACACCA TAG AGCGGTA TTTCACCTTA GTCACCAGAT CGCAGCAGGA GACAGGAAAT 840 
TACACTAGAT TOGTCTTACA Gill GA GCTT CGGAGGAAT G TTCTGTATT T CATTTTGGAA 900 
ACCTACGTTC CTTCCACTTT CCTGGTGGTG 7TGTCCTGGG TTTCATTTTG GATCTCTCTC 960 
GATTCAGTCC CTGCAAGAAC CTGCATTGGA GTGACG ACCG TGTTATCAAT GACCACACTG 1020 
ATG ATCGGGT CCCGCACTTC TCTTCCCAAC ACCAACTGCT TCATCAAGGC CATCGATGTG 1080 
TACCTGGGGA IC1GC1 1 1 A G CI I I J GGGGOCTTGC TAOAATATOC AQTTOCICAC 1140 
TACAGTTCCT TACAGCAG AT GGCAGCCAAA GATAGGGGGA CAACAAAGGA AGTAG AAGAA 1200 
GTCAGTATTA CTAATATCAT CAACAGCTCC ATCTCCAGCT TTAAAOGGAA GATCAGCTTT 1260 
GCCAGCATTG AAATTTCCAG CGACAACGTT GACTACAGTG ACTTGACAAT GAAAACCAGC 1320 
G ACAAGTTCA AGTTTGTCTT CCGAGAAAAG ATGGGCAGGA TTGTTGATTA TTTCACAATT 1380 
CAAAACCCCA GTAATGTTGA TCACTATTCC AAACTACTGT TTCCTTTGAT TTTTATGCTA 1440 
GCCAATGTAT TTTACTGGGC ATACTACATG TATITI 10 AG TCAATGTTAA ATTTCTTGCA 1500 
TGCCATAGGT CTTCAACAGG ACAAGATAAT GATGTAAATG GTATTTTAGG OCAAGTGTGC 1560 
ACCCACATCC AATGGTGCTA CAAGTGACTG AAATAATATTTGAGTCTTTC TGCTCAAAGA 1620 
ATG AAGCTCC AACCATTGTT CTAAGCTGTG TAG AAGTCCT AGCATTATAG GATCTTGTAA 1680 
TAG AAACATC AGTCCATTCC TCTTTCATCT TAATCAAGG A CATTCCCATG GAGCCCAAGA 1740 
TTACAAATGT ACTCAGGGCT GTTTATTOGG TGGCTOOCTG GTTTGCATTT ACCICATATA 1800 
AAGAATGGGA AGGAGACCAT TGGGTAACCC TCAAGTGTCA GAAGTTGTTT CTAAAGTAAC I860 
TATACATGTT TTTTACTAAA TCTCTGCAGT GCTTATAAAA TACATTGTTG CCTATTTAGG 1920 
GAOTAACATT TTCTA U I HI 1 U 1 1 1 C 1 GGT TAAAATO AAA T ATOGGCTT A TOTCAATTCA 1980 
TTGGAAGTCA ATGCACTAAC TCAATACCAA GATGAGTTTT TAAATAATGA ATATTATTTA 2040 
ATACCACAAC AG AATTATCC CCAATTTCCA ATAAGTCCTA TCATTGAAAA TTCAAATATA 2100 
AGTGAAGAAA AAATTAGTAG ATCAACAATC TAAACAAATC CCTCGGTTCT AAGATACAAT 2160 
GGATTCCCCA TACTGGAAGG ACTCTGAGGC TrTATTCCCC CACTATGCAT ATCTTATCAT 2220 
TTTATTATTA TACACACATC CATCCTAAAC TATAC TAAAG COCITTTCCC ATGCATGGAT 2280 
GGAAATGGAA GATTTTTTTG TAACTTGTTC TAGAAGTCTT AATATGGGCT GTTGCCATGA 2340 
AGGCTTGCAG AATTOAGTCC ATTTTCTAGC TGCCTTTATT CACATAGTGA TGGGGTACTA 2400 
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AAAGTACTGG OTTGACTCAO AGAGTCGCTO TCATTCTGTC ATTGCTGCTA CTCTAACACT 2460 
GAGCAACACT CTCCCAGTGG CAGATCCCCT GTATCATTCC AAGAGGAGCA TTCATCOCTT 2520 
TGCTCTAATG ATCAGGAATG ATGCTTATTA GAAAACAAAC TGCTTGACCC AGGAACAAOT 2580 
QGCTTAGCTT AAGTAAACTT GGCTTTGCTC AGATCCCTGA TCCTTCCAGC TGGTCTGCTC 2640 
TGAGTGGCTT ATCCCGCATG AGCAGGAGCG TGCTGGCCCT GAGTACTGAA CTTTCTGAGT 2700 
AACAATGAGA CACGTTACAG AACCTATGTT CAGGTTGCGG GTGAGCTGCC CTCTCCAAAT 2760 
CCAGCCAGAG ATGCACATTC CTCGGCCAGT CTCAGCCAAC AGTACCAAAA GTGATTTTTG 2820 
AGTGTGCCAG GGTAAAGOCT TCCAGTTCAG CCTCAGTTAT TTTAGACAAT CTCGCCATCT 2880 
TTAATTTCTT AGCTTOCTGT TCTAATAAAT GCACGGCTTT ACCTTTCCTG TCAGAAATAA 2940 
ACCAAGGCTC TAAAAGATGA TTTCOCTrCT GTAACTCCCT AGAGCCACAG GTTCTCATTC 3000 
CnilC CCAT TATACTTCTC ACAATTCAGT TTCTATGAGT TTGATCACCT GATTTTTTTA 3060 
ACAAAATATT TCTAACGGG A ATGGGTGGGA GTGCTGOTGA AAAGAG ATGA AATGTGGTTG 3120 
TATGAGCCAA TCATATTTGT G AlllHiA A AAAAAGTTTA AAAGGAAATA TCTGTTCTGA 3180 
AACCCCACTT AAGCATTGTT TTTATATAAA AACAATGATA AAGATGTGAA CTGTGAAATA 3240 
AATATACCAT ATTAGCTACC CACC 

ScqIDNO: 120 Protein sequence: 
Piutem Accession ff: NPJD55026.1 

I 11 21 31 41 51 
) I I II I 

MNYSLHLAFV CLSLFTERMC IQGSQFNVEV GRSDKLSLPG FENLTAGYNK FLRPNFGGEP 60 
VQiALTLDIA S1SSISESNM DYT ATIYLRQ RWMDQRL VFE GNKSFTLDAR LVEFLWVPDT 120 
YTVESKKSFL HEVTVGNRLI RLFSNGTVLY AUUTTTVAC NMDLSKYPMD TQTCKLQLES 180 
WGYDGNDVEF TWLRGNDSVR GLEHLRLAQY TTERYFTLVT RSQQETGNYT RLVLQFELRR 240 
NVLYF1LETY VPSTFLWLS WVSFW1SLDS VPARTCIGVT T VLSMTT LM1 GSKTSLPNTN 300 
CFUCAIDVYL GICFSFVFGA LLEYAVAHYS SLQQMAAKDR GTTKEVEHVS ITNIINSSIS 360 
SFKRKISFAS IEISSDNVDY SDLTMKTSDK FKFVFREKMG RIVDYFTIQN PSNVDHYSKL 420 
LFPLEFMLAN VFYWAYYMYF 

SeqIDNO: 121 DNA sequence 

Nucleic Acid Accession #: NM 001854 

Coding sequence: 163-5582 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

AACCATCAAA TTTAG AAGAA AAAGCCCTTT GACTITI ICC CCCTCTCCCT CCCCAATGGC 60 
TGTGTAGCAA ACATCCCTGG CXiATACCTTG GAAAGGACGA AGTTGGTCTG CAGTCGCAAT 120 
TTCGTGGGTT GAGTTCACAG TTGTGAGTGC GGGGCTCGGA GATGGAGCCG TGGTCCTCTA 180 
GGTGOAAAAC OAAAOGGTGO CTCTGGG ATT TCAOOGTAAC AACOCTOGCA TTG ACCTTCC 240 
TCTTCCAAGC TAGAGAGGTC AGAGGAGCTG CTCCAGTTGA TGTACTAAAA GCACTAGATT 300 
TTCACAATTC TCCAGAGGG A ATATCAAAAA CAACGGGATT TTGCACAAAC AGAAAGAATT 360 
CTAAAGGCTC AG ATACTGCT TACAGAGTTT CAAAGCAAGC ACAACTCAGT GCCCCAACAA 420 
AACAGTTATT TCCAGGTGGA ACTTTCCCAG AAGACTTTTC AATACTATTT ACAGTAAAAC 480 
CAAAAAAAGG AATTCAGTCT TTCCTTTTAT CTATATATAA TG AGCATGGT ATTCAGCAAA 540 
TTGGTGTTGA GGTTGGG AGA TCACCTGTTT TTCTGTTTGA AGACCACACT GGAAAACCTG 600 
CCCCAOAAGA CTATCCOCTC TTCAGAACTG TTAACATCGC TGACGGGAAG TOGCATCGGG 660 
TAOCAATCAG CGTCGAGAAQ AAAACTGTGA CAATGATTGT TOATTGT AAG AAG AAAACCA 720 
CGAAACCACT TGATAGAAGT GAGAGAGCAA TTGTTGATAC CAATGGAATC ACGGTTnTG 780 
GAACAAGGAT TTTGGATGAA GAAGTTTTTG AGGGGGACAT TCAGCAGTTT TTC ATOACAG 840 
GTGATCCCAA GGCAGCATAT GACTACTGTG AGCATTATAG TCCAGACTGT GACTCTTCAG 900 
CACCCAAGGC TGCTCAAGCT CAGGAACCTC AGATAGATGA GTATGCACCA GAGGATATAA 960 
TOG AATATQA CTATOAOTAT GGGGAAOCAO AGTATAAAGA GGCTO AAAOT GTAA CAGAGG 1020 
GAOCCACTGT AACTGAGGAG ACAATAGCAC AGACXKaAGGC AAACATCGTT GATGATTTTC 1080 
AAG AATACAA CTATGGAACA ATGGAAAGTT ACCAGACAGA AGCTCCTAGG CATGTTTCTG 1140 
GGACAAATG A GCCAAATCCA GTTGAAGAAA TATTTACTGA AGAATATCTA ACGGGAGAGG 1200 
ATTATGATTC CCAGAGG AAA AATTCTG AGG ATACACTATA TGAAAACAAA G AAATAGACG 1260 
GCAGGGATTC TGATCTTCTG GTAGATGGAG ATTTAGGCGA ATATGATTTT TATGAATATA 1320 
AAGAATATGA AGATAAACCA ACAAGCCCCC CTAATGAAG A ATTTGGTCCA GGTGTACCAG 1380 
CAGAAACTQA TATTACAGAA ACAAGCATAA ATOGCCATGO TGCATATGGA GAOAAAGGAC 1440 
AGAAAGGAGA ACCAGCAGTG GTTGAGCCTG GTATGCTTGT CGAAGGACCA CCAGGACCAG 1500 
CAGGACCTGC AGGTATTATO GGTCCTCCAG GTCTACAAGG CCCCACTGGA CCCCCTGGTG 1560 
ACCCTGGCOA TAGGGGCCCC CCAGGACGTC CTGGCTTACC AGGGGCTGAT GGTCTACCTG 1620 
GTCCTCCTGG TACTATGTTG ATGTTACCGT TCCGTTATGG TGGTGATGGT TCCAAAGGAC 1680 
CAACCATCTC TGCTCAGGAA GCTCAGGCTC AAGCTATTCT TCAGCAGGCT CGGATTGCTC 1740 
TGAQ AGGCCC ACCTGGCCCA ATGGGTCTAA CTOGAAGACC AGGTCCTGTG OGGOOOOCTO 1800 
GTTCATCTGG GGCXAAAGGT GAGAGTGGTG ATCCAGGTCC TCAGGGCCCT CGAGGCGTCC 1860 
AGGGTCCCCC TGGTCCAACG GGAAAACCTG GAAAAAGGGG TCGTCCAGGT GCAGATGGAG 1920 
GAAGAGGAAT GCCAGGAGAA CCTGGGGCAA AGGGAGATCG AGGGTTTGAT GGACTTCCGG 1980 
GTCTGCCAGG TG ACAAAGGT CACAGGGGTQ AACGAGGTCC TCAAGGTCCT CCAGGTCCTC 2040 
CTGGTGATGA TGGAATGAGG GOAGAAGATG GAGAAATTGG ACCAAGAGGT CTTCCAGGTG 2100 
AAGCTGGCCC ACGAGGTTTG CK5GGTCCAA GGGGAACTCC AGGAGCTCCA GGGCAGCCTG 2160 
GTATGGCAGG TGTAGATGGC CCCCCAGGAC CAAAAGGGAA CATGGGTCCC CAAGGGGAGC 2220 
CTOGGCCTCC AGGTCAACAA GGGAATCCAG GACCTCAGGG TCTTCCTGGT CCACAAGGTC 2280 
CAATTGGTCC TCCTGGTGAA AAAGGACCAC AAGGAAAACC AGGACTTGCT GGACTTCCTG 2340 
GTGCTGATGO GCCTCCTGGT CATCCTGGGA AAGAAGGCCA GTCTGGAGAA AAGGGGGCTC 2400 
TGGGTCCCCC TOGTCCACAA GGTCCTATTG GATNNCCGGG CCCCCGGGGA GTAAAGGGAG 2460 
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CAGATGGTGT CAGAGGTCTC AAGGG ATCTA AAGGTG AAAA GGGTOAAGAT GGTTTTCCAG 2520 
GATTCAAAGG TGACATGGGT CTAAAAGGTG ACAGAGGAGA AGTTGGTCAA ATTGGCCCAA 2580 
GAGGGNAAGA TGGCCCTGAA GGACCCAAAG GTCGAGCAGG OCCAACTGGA GACCCAGGTC 2640 
CTTCAGGTCA AGCAGGAGAA AAGGGAAAAC TTGGAGTTCC AGGATTACCA GGATATCCAG 2700 
5 GAAGACAAGG TCCAAAGGGT TCCACTGGAT TCCCTGGGTT TCCAGGTGCC AATGGAGAGA 2760 
AAGGTGCACG GGGAGTAGCT OGCAAACCAG GCCCTCGGGG TCAGCGTGGT CCAACGGGTC 2820 
CTCGAGGTTC AAGAGGTGCA AGAGGTCCCA CTGGGAAACC TGGGCCAAAG GGCACTTCAG 2880 
GTGGCGATGG CCCTCCTGGC CCTCCAGGTG AAAGAGGTCC TCAAGGACCT CAGGGTCCAG 2940 
TTGGATTOOC TGGACCAAAA GGCCCICCTG GACCACCAGG AAGGATGGGC TGOCCAGGAC 3000 

10 ACCCTOGGCA ACGTGGGG AG ACTGOATTTC AAGGCAAGAC CGGCXXTCCT GGGCCAGGGG 3060 
GAGTGGTTGG ACCACAGGGA CCAACOGGTG AGACTGGTCC AATAGGGGAA CGTGGGTATC 3120 
CTGGTCCTCC TGGOCCTCCT GGTGAGCAAG GTCTTOCTGG TGCTGCAGGA AAAGAAGGTG 3180 
CAAAGGGTGA TCCAGGTCCT CAAGGTATCT CAGGGAAAGA TGGACCAGCA GGATTACGTG 3240 
GTTTCCCAGG GGAAAGAGGT CTTOCTGGAG CTCAGGGTGC ACCTGGACTG AAAGGAGGGG 3300 

1 5 AAGGTCCCCA GGGCCCACCA GGTCCAGTTG GCTCACCAGG AGAACGTGGG TCAGCAGGTA 3360 
CAGCTGGCCC AATTGGTTTA CGAGGGCGCC CGGGACCTCA GGGTOCTCCT OGTCCAGCTG 3420 
GAGAGAAAGG TGCTCCTGGA GAAAAAGGTC OOCAAGGGCC TGCAGGGAG A GATGGAGTTC 3480 
AAGGTCCTGT TGGTCTCCCA GGGCCAGCTG GTCCTGCCGG CTCCCCTGGG GAAGACGGAG 3540 
ACAAGGGTGA AATTGGTGAG CCGGGACAAA AAGGCAGCAA GGGTGGCAAG GGAGAAAATG 3600 

20 GCCCTCCCGG TCCCCCAGGT CTTCAAGGAC CAGTTGGTGC CCCTGGAATT GCTGGAGGTG 3660 
ATGGTGAACC AGGTOCTAGA GGACAGCAGG GGATGTTTGG GCAAAAAGGT GATGAGGGTG 3720 
OCAGAGGCTT CCCTGGACCT CCTGGTCCAA TAGGTCTTCA GGGTCTGCCA GGCCCACCTG 3780 
GTGAAAAAGG TGAAAATGGG GATGTTGGTC CATGGGGGOC ACCTGGTCCT CCAGGCCCAA 3840 
GAGGCCCTCA AGGTCCCAAT GGAGCTGATG GACCACAAGG ACXXXXAGGT TCTGTTGGTT 3900 

25 CAGTTGGTGG TGTTGGAGAA AAGGGTGAAC CTGGAGAAGC AGGAAACCCA GGGCCTCCTG 3960 
GGGAAGCAGG TGTAGGCGGT CCCAAAGGAG AAAGAGGAGA GAAAGGGGAA GCTGGTCCAC 4020 
CTGGAGCTGC TGGACCTCCA GGTGCCAAGG GGCCGCCAGG TGATGATGGC CCTAAGOGTA 4080 
ACCCGGOTCC TGTTGGTTTT CCTGGAGATC CTGGTCCTCC TGGGGAACTT GGCOCTGCAG 4140 
GTCAAGATGG TGTTGGTGGT GACAAGGGTG AAGATGGAGA TCCTGGTCAA CCGGGTCCTC 4200 

30 CTGGCCCATC TGGTGAGGCT GGCCCACCAG GTCCTCCTGG AAAACOAGGT CCTCCTOGAO 4260 

CTGCAGGTGC AGAGGGAAGA CAAGGTGAAA AAGGTGCTAA GGGGGAAGCA GGTGCAGAAG 4320 
GTCCTCCTGG AAAAACCGGC CCAGTCGGTC CTCAGGGACC TGCAGGAAAG CCTGGTCCAG 4380 
AAGGTCTTCG GGGCATCCCT GGTCCTGTGG GAGAACAAGG TCTCCCTGGA GCTGCAGGCC 4440 
AAGATGGACC ACCTGGTCCT ATGGGACCTC CTGGCTTACC TGGTCTCAAA GGTGACCCTG 4500 

3 5 GCTCCAAGGG TGAAAAGGGA CATCCTGGTT TAATTGGOCT GATTGGTCCT CCAGGAGAAC 4560 
AAGGGGAAAA AGGTGACCGA GGGCTCCCTG GAACTCAAGG ATCTCCAGGA GCAAAAGGGG 4620 
ATGGGGGAAT TCCTGGTCCT GCTGGTCCCT TAGGTCCACC TGGTCCTCCA GGCTTACCAG 4680 
GTCCTCAAGG CCCAAAGGGT AACAAAGGCT CTACTGGACC CGCTGGCCAG AAAGGTGACA 4740 
GTGGTCTTCC AGGGCCTCCT GGGCCTCCAG GTCCACCTGG TGAAGTCATT CAGCCTTTAC 4800 

40 CAATCTTGTC CTCCAAAAAA ACGAGAAGAC ATACTGAAGG CATGCAAGCA GATGCAGATG 4860 
ATAATATTCT TGATTACTCG GATGGAATGG AAGAAATATT TGGTTCCCTC AATTCCCTGA 4920 
AACAAGACAT CGAGCATATG AAATTTCCAA TGGGTACTCA GACCAATCCA GCCCGAACTT 4980 
GTAAAGACCT GCAACTCAGC CATCCTGACT TCCCAGATGG TGAATATTGG ATTGATCCTA 5040 
ACCAAGGTTG CTCAGGAGAT TCCTTCAAAO TTTACTGTAA TTTCACATCT GGTGGTGAGA 5100 

45 CTTGCATTTA TCCAGACAAA AAATCTGAGG GAGTAAG AAT T TCATCA TGG CCAAAGGAGA 5160 
AACCAGGAAG TTGGTTTAGT GAATTTAAGA GGGGAAAACT GCTTTCATAC TTAGATGTTG 5220 
AAGGAAATTC CATCAATATG GTGCAAATGA CATTCCTGAA ACTTCTGACT GOCTCTGCTC 5280 
GGCAAAATTT CACCTACCAC TGTCATCAGT CAGCAGCCTG GTATGATGTG TCATCAGGAA 5340 
GTTATGACAA AGCACTTCGC TTCCTGGGAT CAAATGATGA GGAGATGTCC TATGACAATA 5400 

50 ATCCTTTTAT CAAAACACTG TATGATGGTT GTACGTCCAG AAAAGGCTAT GAAAAAACTG 5460 
TCATTGAAAT CAATACACCA AAAATTGATC AAGTACCTAT TGTTGATGTC ATGATCAGTG 5520 
ACTTTGGTGA TCAGAATCAG AAGTTCGGAT TTGAAGTTGG TCCTGTTTGT 11 1U11GGCT 5580 
AAO ATTAAGA CAAAG AACAT ATCAAATCAA CAGAAAATGT ACCTTGGTGC CACCAACCCA 5640 
TTTTGTGCCA CATGCAAGTT TTGAATAAGG ATGTATGGAA AACAACGCTG CATATACAGG 5700 

55 TACCATTTAG GAAATACCGA TGCCTTTGTG GGGGCAGAAT CACAGACAAA AQCTTTGAAA 5760 
ATCATAAAGA TATAAGTTGG TGTGGCTAAG ATGGAAACAG GGCTGATTCT TGATTCCCAA 5820 
TTCTCAACTC TCCITTTCCT ATTTGAATTT CTTTGGTGCT GTAG AAAACA AAAAAAGAAA 5880 
AATATATATT CATAAAAAAT ATGGTGCTCA TTCTCATCCA TCCAGGATGT ACTAAAACAG 5940 

„ TGTGTTTAAT AAATTGTAAT TATTTTGTGT ACAGTTCTAT ACTGTTATCT GTGTCCATTT 6000 

60 CCAAAACTTG CACGTGTCCC TGAATTCCGC TGACTCTAAT TTATGAGGAT GCCGAACTCT 6060 
GATGGCAATA ATATATGTAT TATGAAAATG AAGTTATGAT TTCCGATGAC CCTAAGTCCC 6120 
TTTCTTTGGT TAATO ATGAA ATTCCTTTGT GTGTGTTT 

Seg ID NO: 122 Protein sconertce: 
65 Protein Accession ft NP_C01845 



1 It 21 31 41 51 

70 MEPWSSRWKT KRWLVTOFTVT TLALTFLFQA REVRGAAPVD VLKALDFHNS PEGISKTTGF 60 
CTNRKNSCGS DTAYRVSKQA QLSAPTKQLF PGGTFPEDFS ILFTVKPKKG 1QSFLLSIYN 120 
EHGIQQIGVE VGRSPVFLFE DHTGKPAPED YPLFRTVNIA DGKWHRVAIS VEKKTVTMIV 180 
DCKKKTTKPL DRSERAIVDT NG1TVFGTRI LDBEVFEGDI QQFUTGDPK AAYDY CEHYS 240 
PDCPSSAPKA AQAQEPQIDE YAPEDHEYD YEYO EAEYKB AfcSVTEUHl V 1EB1J AQTEA 300 

75 NIVDDFQEYN YGTMESYQTBAPRHVSOTNBPNPVEEIFrE BYLTGEDYDS QRKNSEDTLY 360 
ENKEIBGRDS DLLVDGDLGE YDFYBYKEYE DKPTSPPNEE FGPGVPAETD lTETSINGHa 420 
AYGEKGQKGB PAWEPGMLV BGPPGPAGPA GIMGPPGLQG FTGFPGDPGD RGPPGRPGLP 480 
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G ADGLPGPPG TMLMLPFRYG GDGSKGPT1S AQEAQAQAIL QQAR1ALRGP PGPMGLTGRP 540 
GPVGGPGSSG AKGESGDPGP QGPRGVQGPP GPTGKPGKRG RPGADGGRGM PGEPGAXGDR 600 
GFDGLPGLPG DKGHRGERGP QGPPGPPGDD GMRGEDGHG PRGLPGEAGP RGLLGPROTP 660 
GAPGQPGMAG VDGPPGPKGN MGPQGBPGPP GQQGNPGPQG LPGPQGPlGP PGHKGPQGKP 720 
GLAGLPGADG PPGHPGKEGQ SGEfCGALGFP GPQGPIGXPG PRGVKGADGV RGLKGSKGEK 780 
GEDGFPGFKG DMGLKGDRGE VGQIGPRGXD GPEGPKGRAG PTGDPGPSGQ AGEK.GKLGVP 840 
GLPGYPGRQG PKGSTGFPGF PGANGEKG AR GVAGKPGPRG QRGPTGPRGS RGARGPTGKP 900 
GPKGTSGGDG PPGPPGERGP QGPQGPVGFP GPKGFPGPPG RMGCPGHPGQ RGBTGFQGKT 960 
GPPGPGGWG PQGPTGETGP IGERGYPGPP GPPGEQGLPG AAGKEGAKGD PGPQGISGKD 1020 
GPAGLRGFPG ERGLPGAQGA PGLKGGEGPQ GPPGPVGSPG ERGSAGTAGP IGLRGKPGPQ 1080 
GPPGPAGEKG APQEKGPQGP AGRDGVQGPV GLPGPAGPAG SPGEDGDKGB IGEPGQKGSJC 1140 
GGKGENGPPG FPGLQGPVGA PGIAGGDGEP GPRGQQGMFG QKGDEGARGF PGPFGPIGLQ 1200 
GLPGFPGEKG ENGDVGPWGP PGPPGPRGPQ GPNGADGPQG PPGSVGSVGG VGEKGEPGEA 1260 
GNPGPPGEAG VGGPKGERGE KGEAGPPGAA GPPGAKGPPG DDGPKGNPGP VGFPGDPGPP 1320 
GELGPAGQDG VGGDKGEDGD PGQPGPPGPS GEAGPPGPPG KRGPFGAAGA EGRQGEKGAK 1380 
GEAGAEGPPG KTGPVGPQGP AGKPGPEGLR GIPGPVGBQG LPGAAGODGP PGPMGPPGLP 1440 
GLKGDPGSKG EKGHPGLIGL IGPPGEQGEK GDRGLPGTQG SPGAKGDGGI PGPAGPLGPP 1500 
GPPGLPGPQG PKGNKGSTGP AGQKGDSGLP GPPGPPGPPG EVIQPLPILS SKKTRRHTEG 1560 
MQADADDNIL DYSDGMEEIF GSLNSLKQD1 EHMKFPMGTQ TNPARTCXDL QLSHPDPPDG 1620 
EYWIDPNQGC SGDSFKVYCN FTSGGHTOY PDKKSEGVRI SSWPKEKPGS WFSEFKRGKL 1680 
LSYLPVEGNS INMVQMTFLK. LLTASARQNF TYHCHQSAAW YDVSSGSYDK AL RFLG SNDB 1740 
EMSYDNNPFI KTLYDGCTSR KGYEKTVIEI NTFKIDQVPt VDVME5DFGD QNQKFGFEVG 1800 
PVCFLG 

Seq ID NO: 123 DNA sequence 

Nucleic Acid Accession ft NM 015886 

Coding sequence: 485*1261 (nndertined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

GAATTCCCCC CCCCCCCCCC TCACTTGGTG TGTCTATATG TCTGGCAGAC ATTATCAGCA 60 
CATTCTCTGT TGTTACCTGT GATTCATTTT TTCTTCACTC TCCAGGTGAA TTTCAATTGC 120 
TGAAAATTTC CCACTGAAAA TATOCAGTAA TATATTTTGT GGTTCAGACA TTTGGGGCAA 180 
ATGGTTCACA TTCATTTTAG GGTTAGTGGT CATGCTGTTT ATTTTTCTCT GCTATACAAA 240 
GTTCCTCTTA GGGGTCTGCC TCATGACACT AAAAAATGAA TAGAGATTCT ACTGTAGGTT 300 
ATCTCCTAGG CTTGAGTTCA ACATTTGTTT GGATTTTTQA AGAAAGTCAA ATCAAGCAAT 360 
GCTCCCAAAT GAll i TC l 1 l ii TAAATTCATA CCCTCTGGCC CTATTTTTTT TCATAGACCC 420 
TAACTCTACC TTTCTGCTTT AAAGCAAAGT AAACTCGGTG GOCTCTTCTT CTCCACCCCT 480 
CAAAATG ATA OCAATCTCTO CCGTCAGCAQ TGCACTCCTO TTCTCCCTTC TCTGTG AAGC 540 
AAGTACCGTC GTCCTACTCA ATTCCACTGA CTCATCCCCG CCAACCAATA ATTTCACTGA 600 
TATTGAAGCA GCTCTGAAAG CACAATTAG A TTCAGCGGAT ATCCCCAAAG CCAGGCGGAA 660 
GCGCTACATT TCGCAG AATO ACATGATCGC CATTCTTGAT TATCATAATC AAGTTCGGGG 720 
CAAAGTGTTC OCACCGGCAG CAAATATGGA ATATATGGTT TGGGATGAAA ATCTTGCAAA 780 
ATCGGCAGAG GCTTGGGCGG CTACTTGCAT TTGGGACCAT GGACCTTCTT ACTTACTGAG 840 
ATTTTTGGGC CAAAATCTAT CTGTACGCAC TGGAAGATAT CGCTCTATTC TCCAGTTGGT 900 
CAAGCCATGG TATGATGAAG TGAAAGATTA TGCI11IUJA TATCCCCAGG ATTGCAACCC 960 
CAGATGTCCT ATGAGATOTT TTGGTCOCAT GTGCACACAT TATAOGCAGA TGGTTTGGGC 1020 
CACTTCCAAT CGG ATAGGAT GCGCAATTCA TGCTTGOCAA AACATGAATG TTTGGGGATC 1080 
TGTGTGGCGA CGTGCAGTTT ACTTGGTATG CAACTATGCC CCAAAGGGCA ATTGGATTGG 1140 
AGAAGCACCA TATAAAGTAG GGGTAOCATG TTCATCTTGT CCTOCAAGTT ATGGGGGATC 1200 
TTGTACTGAC AATCTGTGTT TTCCAGGAGT TACGTCAAAC TACCTGTACT GGTTTAAATA 1260 
AGTTTACCTT TTOCTCCAGG AAATATAATG ATTTCTGGGA ACATGGGCAT GTATATAT AT 1320 
ATATGGAGAG AGAATTTTGC ACATATTATA CATATTTTGT GCTAATCTTG TTTTCCTCTT 1380 
AGTATTCCrr TGTATAAATT AGTGTTTGTC TAGCATGTTT GTTTAATCCT TTGGG AATTC 

Seq ID NO: 124 Protem sequence: 
Protein Accession NP_056970.1 

1 11 21 31 41 51 
I 1 I 1 I I 

MIAKAVSSA LLFSLLCEAS TWLLNSTDS SPPTNNFTDI BAALKAQLDS ADIPKARRKR 60 
YBQNDMIAI LDYHNQVRGK VFPPAANMEY MVWDENLAKS AEAWAATCIW DHGPSYLLRF 120 
LGQNLSVRTG RYRSILQLVK PWYDEVKDYA FPYPQDCNPR CPMRCFGPMC THYTQMVWAT 180 
SNRIGCAIHA CONMNVWGSV WRRAVYLVCN YAPKGNWIGE APYKVGVPCS SCPPSYGGSC 240 
TDNLCFPG VT SNYLYWFK 



Seq ID NO: 125 DNA sequence 

Hucksc Acid Accession ft NMJ0O1793 

Coding sequence: 54-2543 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

GCGGAACACC GGCCCGCCGT CGCGGCAGCT GCTTCACCCC TCTCTCTGCA GCCAJGGGGC 60 
TCCCTCGTGG ACCTCTCGCG TCTCTCCTCC TTCTCCAGGT TTGCTGGCTG CAGTGCGCGG 120 
CCTCCGAGCC GTGCCGGGCG GTCTTCAGGG AGGCTGAAGT GACCTTGGAG GCGGGAGGCG 180 
OGGAGCAGGA GCCCGGCCAG GCGCTGGGGA AAGTATTCAT GGGCKJCCCT GGGCAAGAGC 240 
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CAGCTCTOTT TAOCACTG AT AATG ATG ACT TCACTGTGCG GAATGGCGAO ACAGTCCAGG 300 
AAAGAAGGTC ACTGAAGGAA AGGAATCCAT TGAAGATCTT CCCATCCAAA CGTATCTTAC 360 
GAAGACACAA GAGAGATTGG GTGGTTGCTC CAATATCTGT CCCTGAAAAT GGCA AGGGTC 420 
CCTTCCCCCA G AGACTGAAT CAGCTCAAGT CTAATAAAGA TAGAGACACC AAGATTTTCT 480 
5 ACAGCATCAC GGOGCCOGGG GCAGACAGCC CCCCTGAGGG TGTjCTTOGCT GTAGAGAAGG 540 
AGACAGGCTG GTTGTTGTTG AATAAGCCAC TGGACCGGGA GG AGATTGCC AAGTATGAGC 600 
TCTTTGGCCA CGCTGTGTCA GAGAATGGTG CCTCAGTGGA GGACCCCATG AACATCTCCA 660 
TCATCGTOAC OGACCAGAAT GACCACAAGC CCAAGTTTAC CCAGGACACC TTCCGAGGG A 720 
GTGTCTTAGA GOGAOTCCTA CCAGGTACTT CTGTG ATGCA GGTGACAGCC ACAGATGAGG 780 

10 ATGATGCCAT CTACACCTAC AATGGGGTGG TTGCTTACTC CATCCATAGC CAAGAACCAA 840 
AGGACCCACA CGACCTCATC TTCACAATTC ACCGGAGCAC AGGCACCATC AGOGTCATCT 900 
CCAGTGGCCT GGACOGGGAA AAAGTCOCTG AGTACACACT GACCATCCAG GCCACAG ACA 960 
TGGATOGGGA CGGCTCCACC ACCACGGCAG TGGCAGTAGT GGAGATCCTT GATGCCAATG 1020 
ACAATGCTCC CATGTTTGAC CCCCAGAAGT ACGAGGCCCA TGTGCCTGAG AATGCAGTGG 1080 

15 GCCATGAGGT GCAGAGGCTG ACGGTCACTG ATCTGGACGC CCCCAACTCA CCAGCGTOGC 1 1 40 
GTGCCACCTA CCTTATCATG GGCGGTG ACG ACGGGGACCA TTTTACCATC ACCACCCACC 1200 
CTGAGAGCAA CCAGGGCATC CTGACAACCA GGAAGGGTTT GGATTTTGAG GCCAAAAACC 1260 
AGCACAOCCT GTACOTTGAA GTGACCAACG AGGOOOtTIl UilGCl OAAO CTCCCAACC T 1320 
CCACAGCCAC CATAQTGGTC CACGTGGAGG ATGTQAATOA GQCACLWfli TTTgTOCCAC 1380 

20 CCTCCAAAGT CGTTGAGGTC CAGGAGGGCA TCCCCACTGG GGAGCCTGTG TGTGTCTACA 1440 
CTGCAGAAGA CCCTGACAAG GAGAATCAAA AGATCAGCTA CCGCATCCTG AGAG AOCCAG 1500 
CAGGGTGGCT AGCCATGGAC CCAGACAGTG GGCAGGTCAC AGCTGTGGGC ACCCTCGACC 1560 
GTGAGG ATGA GCAGTTTGTG AGGAACAACA TCTATGAAGT CATGGTCTTG GCCATGGACA 1620 
ATGGAAGCCC TCCCACCACT GGCACGGGAA CCCTTCTGCT AACACTG ATT GATGTCAACG 1680 

25 ACCATGGCCC AGTCCCTGAG CCCCGTCAGA TCACCATCTG CAACCAAAGC CCTGTGCGCC 1740 
ACGTGCTGAA CATCACGGAC AAGGACCTGT CTCCCCACAC CTCCCCTTTC CAGGCOCAGC 1800 
TCACAGATQA CTCAGACATC TACTGGACGO CAGAGGTCAA CGAGGAAGGT OACACAGTGG 1860 
TCTTGTCCCT GAAGAAGTTC CTGAAGCAGG ATACATATGA CGTGCACCTT TCTCTGTCTG 1920 
ACCATGGCAA CAAAGAGCAG CTGACGGTGA TCAGGGCCAC TGTGTGCGAC TGCCATGGCC 1980 

30 ATGTCGAAAC CTGCCCTGGA CCCTGGAAAG GAGGTTTCAT CCTCCCTGTG CTGGGGGCTG 2040 
TCCTGGCTCT GCTGTTCCTC CTGCTGGTGC TG U11 IIOU GGTGAGAAAG AAGCGGAAGA 2100 
TCAAGGAGCC CCTCCTACTC CCAGAAGATG ACAGCCCTGA CAACGTCTTC TACTATGGCG 2160 
AAGAGGGGGG TGGCGAAGAG GAGCAGGACT ATGACATCAC CCAGCTOCAC CGAGGTCTGG 2220 
AGGCCAGGCC GGAGGTGGTT CTCCGCAATG ACGTGGCACC AACCATCATC CCGACACCCA 2280 

35 TGTACCGTCC TAGGCCAGCC AAOCCAGATG AAATCGGCAA CTTTATAATT GAGAACCTGA 2340 
AGGCGGCTAA CACAGACCCC ACAGCCCCGC CCTACGACAC CCTCTTGGTG TTCGACTATG 2400 
AGGGCAGCOG CTCCGACGCC GCGTCCCTOA GCTCCCTCAC CTCCTCCGCC TCCGACCAAG 2460 
ACCAAOATTA CGATTATCTG AACGAGTGGG GCAGCCGCTT CAAGAAGCTG GCAGACATGT 2520 
ACGGTGGCGG GGAGGACGAC JASGCGGCCT GCCTGCAGGG CTGGGG ACCA AACGTCAGGC 2580 

40 CACAGAGCAT CTCCAAGGGG TCTCAOTTCC CCCnCAOCTOAGGACTTCG OAGCTTOTCA 2640 
GGAAOTQGCC GTAGCAACTT GGCGGAGACA OGCTATQAGT CTGACGTTAG ACiltiGl 1 OCT 2700 
TCCTTAGCCT TTCAGGATGG AGGAATGTGG GCAGTTTGAC TTCAGCACTG AAAACCTCTC 2760 
CACCTGGGCC AGGGTTGCCT CAGAGGCCAA GTTTCCAGAA GCCTCTTACC TGCCGTAAAA 2820 
TGCTCAACOC TGTGTOCTGG GCCTOGGOCT GCTGTGACTG ACCTACAGTG GACTTTCTCT 2880 

45 CTGGAATGGA ACCTTCTTAG GCCTCCTGGT GCAACTTAAT 1 1 J J 1 1 1 J J 1 AATGCTATCT 2940 

TCAAAACGTT AGAGAAAGTT CTTCAAAAGT GCAGCCCAQA GCTGCTGGGC CCACTGGCCG 3000 
TCCTGCATTT CTGGTTTCCA GACCCCAATG CCTCCCATTC GGATGGATCT CTGCGTTTTT 3060 
ATACTGAOTG TGCCTAGGTT GCCCCTTATT TTTTATTTTC C CTGTTGOGT TGCTATAG AT 3120 
G AAGGGTGAG GACAATCGTG TATATGTACT AGAACTTTTT TATTAAAGAA A 

50 

SeqlDNO: 126 Prorcro seqnencc: 
Protein Accession #: NP_001784 

1 11 21 31 41 51 

55 | i i i I I 

MGLPRGPLAS LLLLQVCWIjQ CAASEPCRAV FRBAEVTLEA GGAEQEPGQA LGKVFMGCPG 60 
QEPALFSTDN DDFTVRNQET VQERRSLKER NPUOFPSKR ILRRHKRDWV VAPISVPENO 120 
KGPFPQRLNQ UCSNKDRDTK JJFYSITGPGA DSPPEGVFAV BKETGWLLLN KPLDREELAK 180 
YELFGHAVSE NGASVEDPMN ISIIVTDQND HKPKFTQDTF RGSVLEGVLP GTSVMQVTAT 240 

60 DEDDAIYTYN GWAYSIHSQ EPKDPHDLMF TIHRSTGTIS V1SSGLDREK VPHYTLTIQA 300 

TDMDGDGSTT TAVA WHLD ANDNAPMFDP QKYEAHVPEN AVGHBVQRLT VTDLDAPNSP 360 
A WRATYLJMG GDDGDHFTTT THPESNQGBL TTRKG1DFBA KNQHTLYVEV TNEAPFVLKL 420 
PTSTATIWH VEDVNHAPVF VPPSKWEVQ EGIPTGBPVC VYTAEDPDKE NQKISYRILR 480 
DPAGWLAMDP DSGQVTAVGT LDREDEQFVR NNIYBVMVLA MDNGSPPTTG TGTLLLTLID 540 

65 VNDHGPVPEP RQmCNQSP VRHVLNTTDK DLSPHTSPFQ AQLTDDSDIY WTABVHEBGD 600 
TWLSLKKFL KQDTYDVHLS LS3HGNKHJL TVIRATVCDC HGHVETCPGP WKGGFILPVL 660 
GAVLALLFLL LVLLLLVRKK RKKEPLLLP EDDTRDNVFY YGHEGGGBED QDYDITQLHR 720 
GLEARFEWL RNDVAPTIIP TPMYRPRPAN PDHGNFUH NLKAAKTDFT APPTOTIXVF 780 
DYEGSGSDAA SLSSLTSSAS DQDQDYDYLN EWGSRFKKLA DMYGGGEDD 

70 

SeqlDNO: 127 DNA sequence 

Modoc Acid Accession & NM 003256.1 

Coding sequence: 60-734 (anderiined sequences correspond to stait aod stop codons) 

75 1 11 21 31 41 51 
1 I I I I I 

CCTGCTGGGG CCGTCCAGTC CCCCAGACCT CACAGGCTCA GTCGCGGATC TGCAGTGTCA 60 
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JGCCTGGGAG CCCTCGGCCC GCGCCAAGCT GGGTGCTGTT GCTGCGGCTG CTGGCGTTGC 120 
TGCGGCCCCC GGGGCTGGGT GAGGCATGCA GCTGCGCCCC GGCGCACCCT CAGCAGCACA 180 
TCTGCCACTC GGCACTTGTG ATTOGGGCCA AAATCTOCAG TGAG AAGGTA GTTCCGGCCA 240 
GTGCAGACCC TGCTGACACT GAAAAAATGC TCCGGTATG A AATCAAACAG ATAAAGATGT 300 
TCAAAGGOTT TGAGAAAGTC AAGGATGTTC AGTATATCTA TACGCCTTTT GACTCTTCCC 360 
TCTGTGGTGT GAAACTAGAA GCCAACAGCC AGAAGCAGTA TCTCTTGACT GGTCAGGTCC 420 
TCAGTGATGG AAAAGTCTTC ATCCATCTOT GCAACTACAT CXJAGCCCTGG GAGGACCTGT 480 
CCTTGGTGCA G AGGGAAAGT CTGAATCATC ACTACCATCT G AACTGTGGC TGCCAAATCA 540 
CCACCTGCTA CACAGTACCC TGTACCATCT CGGCCCCTAA CGAGTGCCTC TGGACAGACT 600 
GGCTGTTGGA ACGAAAGCTC TATGGTTAOC AGGCTCAGCA TTATGTCTGT ATGAAGCATG 660 
TTGAGGGCAC CTGCAGCTGG TACOGGGGCC ACCTGCCTCT CAGGAAGGAG TTTGTTGACA 720 
TCGTTCAGCC CTAGTAGGGA CCAGTGACCA TCACATCCCT TCAAGAGTCC TGAAGATCAA 780 
GCCAGTTCTC CTTCCCTGCA GAGCTTTGGC CATTAGCACC TGACCTCTTG CTGCCAGCTA 840 
ATAAGAAGTG CCAAGTGGAC AGTCTGGCCA CTGTCAAGGC AGGGAAGGGG CCATGACTTT 900 
TCTGCCCTGC CCTCAGCCTG TTGCCCTGCC TCCCAAACCC CATTAGTCTA GCCTTGTAGC 960 
TGTTACTGCA AGTGTTTCTT CTGGCTTAGT CTGTTTTCTA AAGCCAGGAC TATTCCCTTT 1020 
CCTCCCCAGG AATATGTGTT TTCCTTTGTC TTAATCG ATC TGGTAGGGG A GAAATGGCGA 1080 
ATGTCATACA CATGAGATGG TATATCCTTG CGATGTACAG AATCAGAAGG TGGTTTGACA 1140 
GCATCATAAA CAGGCTGACT GGCAGGAATG AAAAAAAAAA AAAAAAAAA 

SeqIDNO: 128 Protein sequence; 
Protein Accession #: NP 003247.1 

1 II 21 31 41 51 
1 1 1 I 1 I 

MPGSPRPAPS WVLIXRULAL LRPPGLGEAC SCAPAIIPQQH ICHSALVIRA K1SSEKWPA 60 
SADPADTEKM LRYBDCQKM FKGFEKVKDV QYIYTPFDSS LCG VKLEANS QKQYLLTGOV 120 
LSDGKVFIHL CNYIEPWEDL SLVQRESLNH HYHLNCGCQI TTCYTVPCTI SAFNECLWTD 180 
WLLERKLYGY QAOHYVCMKH VDGTCSWYRG HLPLRKEFVD IVQP 

SeqIDNO: 129 DNA sequence 

Nucleic Add Accession #: NM_007207.2 

Coding sequence: 143-1591 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

CCACGCGTCC GCAATGAAGC CGAGTGAATG GGGGCTGAAT GTGCGAGTCC ATAGCTGAAG 60 
AGOAGCGCCA OATGGTGGAG GAATACAC TT ATTTATGA AA CTOTCTTGAG TTCTTCTTQA 120 
A1TGCCAG1 1 1 XCACCCTCC TCATGCCTCC GlUilXJl 1 1 A GAOGAGAGGG TAGTAGTGGC 180 
ACTATCTAGG CCCGTCCG AC CTCAGGATCT CAACCTTTGT TTAGACTCTA GTTACCTTGG 240 
CTCTGCCAAC CCAGGCAGTA ACAGCCACCC TCCTGTCATC GCCACCACCG TrGTGTCCCT 300 
CAAGGCTGCG AATCTGAOGT ATATGCCCTC ATCCAGCGGC TCTGCCCGCT CGCTG AATTG 360 
TGGATGCAGC AGTGCCAGCT GCTGCACTGT GGCAACCTAC GACAAGGACA ATCAGGCCCA 420 
AACCCAAGCC ATTGCCGCTG GCACCACCAC CACTGCCATC GG AACCTCTA GCAOCTGCOC 480 
TGCTAACCAG ATGGTCAACA ATAATGAGAA TACAGGCTCT CTAAGTCCAT CAAGTGGGGT 540 
GGGCAGCCCT GTGTCAGGG A CCCCCAAGCA GCTAGCCAGC ATCAAAATAA TCTACCCCAA 600 
TGACTTGGCA AAGAAGATGA CCAAATGCAG CAAGAGTCAC CTGCCGAGTC AGGGCCCTGT 660 
CATCATTG AC TGCAGGCCCT TCATGGAGTA CAACAAGAGT CACATCCAAG G AGCTGTOCA 720 
CATTAACTGT GCCGATAAGA TCAGCCGGCG GAG ACTGCAG CAGGGCAAGA TCACTGTCCT 780 
AGACTTGATT TCCTGTAGGG AAGGCAAGGA CTCTTTCAAG AGGAlCllll CCAAAGAAAT 840 
TATAGTTTAT GATOAGAATA CCAATGAACC AAGCCGAGTG ATGCCCTCCC AGCCACTTCA 900 
CATAGTCCTC GAGTCCCTGA AGAGAGAAGG CAAAGAACCT CTGGTGTTGA AAGGTGGACT 960 
TAGTAGTTTT AAGCAGAACC ATGAAAACCT CTGTGACAAC TCCCTCCAGC TCCAAGAGTG 1020 
CCGGGAGGTG GGGGGCGGCG CATCCGCGGC CTCGAGCTTG CTACCTCAGC CCATCCCCAC 1080 
CACCCCTGAC ATCGAGAACG CTGAGCTCAC CCCCATCTTG CCCTTCCTGT TCCTTGGCAA 1140 
TGAGCAGGAT GCTCAGGACC TGGACACCAT GCAGCGGCTG AACATCGGCT ACGTCATCAA 1200 
CGTCACCACT C^TCTTCCCC TCTACCACTA TGAG AAAGGC CTGTTCAACT ACAAGCGGCT 1260 
GCCAGCCACT G ACAGCAACA AGCAG AACCT G CGGCAGTAC TTTGAAG AGO CTTTTG AGTT 1320 
CATTGAGGAA GCTCACCAGT GTGGGAAGGG GCTTCTCATC CACTGCCAGG CTGGGGTGTC 1380 
CCGCTCCGCC ACCATCGTCA TCGCTTACTT G ATGAAGCAC ACTCGGATGA CCATGACTGA 1440 
TGCTTATAAA TTTGTCAAAG GCAAACGACC AATTATCTCC CCAAACCTTA ACTTCATGGG 1500 
GCAGTTGCTA G AGTTCGAGG AAGACCTAAA CAACGGTGTG ACACCGAG AA TCCTTACACC 1560 
AAAGCTGATG GGCGTGGAGA CGOTTOT GTQ AC AATGGTCT GGATGGAAAG GATTGCTGCT 1620 
CTCCATTAGG AG ACAATGAG G AAGG AGGAT GGATTCTGGT 1111 1 1 JL1 1 1C11111AU 1680 
TTGTAGTTGG GAGTAAAGTT TGTGAATGGA AACAAACTTO GTTAAACACT TTATTTTTAA 1740 
CAAGTGTAAG AAGACTATAC TTTTGATGCC ATTGAGATTC ACCTTCCACA A ACTGGCCAA 1800 
ATTAAGGAGG TTAAAGAAGT AATTTTTTTT AAGCCCAACC ATTAAAAATT TAATACAACT I860 
TGGTTTCTCC CCCTTTTTCC TTTAAAGCTA NTTTGTAAAA GTTTATGAG 

SeqIDNO: 130 Protein sequence: 
Protein Accession #: NP_009138.1 

I 11 21 31 41 51 
I I I I I I 

MPPSPLDDRV WALSRPVRP QDLNLCLDSS YLGSANPGSN SHPPVIATTV VSLKAANLTY 60 
MPSSSGSARS LNCGCSSASC CTVATYDKDN QAQTQAIAAG TTTTAIGTST TCPANOMVNN 120 
NENTGSLSPS SGVGSPVSGT PKQLASIKII YPNDLAKXMT KCSKSHLPSQ GPVIIDCRPF 180 

406 



WO 02/059377 



PCT/US02/02242 



MEYNKSKQG AVHINCADKI SRRRLQQGKI TVLDLISCRE GKDSFKRIFS KEHVYDENT 240 
NEPSRVMPSQ PLHTVLESliC REGKEPLVLK GGLSSFKQNH ENLCDNSLQL QECREVGGGA 300 
SAASSLLPQP 1PTTPD1ENA ELTPILPPLF LGNBQDAQDL DTMQRLNIQY VTNVTTHLPL 360 
YHYEKGLFNY KRLPATDSNK QNLRQYFHEA FEF1EEAHQC GKGLUHCQA GVSRSATTVI 420 
5 AYLMKHTRMT MTDAYKFVKG KRPUSPNLN FMGQ1XHFEE DLNNGVTPR1 LTPKLMGVET 480 
W 

ScqIDNai3l DMA sequence 
NncWc Add Accession*: NM_005409J 
10 Coding sequence 94-378 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

TTCXHTTCAT GTTCAGCATT TCTACTCCIT CCAAGAAGAG CAGCAAAGCT GAAGTAGCAG 60 

15 CAACAGCACC AGCAGCAACA GCAAAAAACA AACATGAGTG TGAAGGGCAT GGCTATAGCC 120 
TTGGCTGTGA TATTGTGTGC TACAGTTGTT CAAGGCTTCC CCATGTTCAA AAGAGQACGC 180 
TGTCTTTCCA TAGGCCCTGG GGTAAAAGCA GTGAAAOTGG CAOATATTGA GAAAGCCTCC 240 
ATAATGTACC CAAGTAACAA CTGTGACAAA ATAGAAGTGA TTATTAOCCT GAAAGAAAAT 300 
AAAGGACAAC G ATGCCTAAA TCCCAAATCG AAGCAAGCAA GGCTTATAAT CAAAAAAGTT 360 

20 GAAAGAAAGA ATTTTTAAAA ATATCAAAAC ATATGAAGTC CTGGAAAAGG GCATCTGAAA 420 
AACCTAGAAC AAGTTTAACT GTGACTACTG AAATGACAAG AATTCTACAG TAGGAAACTG 480 
AGAcnncr atggttttgt gactttcaac ttttgtacag TTATGTGAAG GATGAAAGGT S40 
GGGTGAAAGG ACCAAAAACA GAAATACAGT CTTCCTGAAT GAATGACAAT CAOAATTCCA 600 
CTGCCCAAAG GAGTCCAGCA ATTAAATGGA TTTCTAGGAA AAGCTACCTT AAG AAAGGCT 660 

25 GGTTACCATC GQAGTTTACA AAGTOCTTTC ACGTTCTTAC TTGTTGTATT ATACATTCAT 720 
GCATTTCTAG GCTAG AGAAC CTTCTAGATT TGATGCTTAC AACTATTCTG TTOTOACTAT 780 
GAGAACATTT CTGTCTCTAG AAGTTATCTG TCTGTATTGA TCTTTATGCT ATATTACTAT 840 
CTOTOOTTAC AQTGOAQACA TTOACATTAT TACTGGAGTC AAGOOCTTAT AAGTCAAAAG 900 
CATCTATOTG TCGTAAAGCA TTCCTCAAAC Al l 1T11CAT GCAAATACAC ACT1 U111CC 960 

30 CCAAATATCA TGTAGCACAT CAATATGTAG GGAAACATTC TTATGCATCA TTTGGTTTGT 1020 
TTTATAACCA ATTCATTAAA TGTAATTCAT AAAATGTACT ATQ AAAAAAA TTATACGCTA 1080 
TGGGATACTQ GCAACAOTOC ACATATTTCA TAACCAAATT AGCAGCACCG GTCTTAATTT 1140 
OAT UITlHC AACTTTTATT CATTGAGATO TTTTGAAGCA ATTAGGATAT GTGTGTTTAC 1200 
TGTACTTTTT GTTTTGATCC GTTTGTATAA ATGATAGCAA TATCTTGGAC ACATTTGAAA 1260 

35 TACAAAATOT TTTTGTCTAC CAAAGAAAAA TGTTGAAAAA TAAGCAAATG TAT ACCTAG C 1320 
AATCACTTTT A C11I11G TA ATTCTGTCTC TTAGAAAAAT ACATAATCTA ATCAATITCT 1380 
TTGTTCATGC CTATATACTG TAAAATTTAG GTATACTCAA GACTAGTTTA AAGAATCAAA 1440 
GTCATTTTTT TCTCTAATAA ACTACCACAA CCTTTCTTTT TTAAAAAAAA AAA 

40 SeqlDNCh 132 Protein seonence: 

Protein Accessions: NP005400.1 

1 11 21 31 41 51 

45 MSVKGMA1AL AVIU^ATWQ GFPMFKRGRC LCIGPGVKAV KVADIEKASI MYPSNNCDKI 60 
EVHTUCENK GQRCLNPKSK QARLUKKVE RKNF 



Scq ID NO. 133 DMA sequence 

Nnctefc Acid Accession #: NM 012342 I 

50 Q)ding sequence 373-1 155 (omledniedseqiKDCCSConeapoiidtD $ 

1 11 21 31 41 51 

CTGGCGCGGG OGGGAGCTGC GGCGG ATAOC CITGCGTGCT GTGGAGACCC TACTCTCTTC 60 
55 GCTGAGAACG GCCGCTAGCX5 GGGACTGAAG GCCGGGAGCC CACTCOCGAC CCGGGGCTAG 120 

CGTOCGTCOC TAGAGTCGAG CGGGGCAAGG GAGCCAGTGG CCGCCGACGG GGGACCGGGA 180 

AACTTTTCTG GGCTOCTGGA GAGCCCTGTA GCCGCGCTCC ATGCTOOGGC AGCGGOCCGA 240 

AA<XCAGCCC CGCCGCTGAC GGAGCCCGOC GCTCCGGGCA GGGOOCATGC CCTGCGCGCT 300 

CCGGGGGTCG TAGCTGCCGC CGAGCCGGGG CTCCGGAAGC CGGOGGGGGC GCCGCGGCCG 360 
60 TGOGGGOOGT CAATGGATCG CCACTCCAGC TACATCTTCA TCTGOCTOCA GCTGGAGCTC 420 

TGCCCCATGG COGTGCTGCT CACCAAAGQT GAAATTOGAT GCT ACTGTGA TGCTGCCCAC 480 

TGTGTAGCCA CTGGTTATAT OTOTAAATCT GAGCTCAGCG CCTQCTTCTC TAGACTTCIT 540 

GATCCTCAGA ACTCAAATTC CCCACTCACC CATGGCTGOCTGGACTCTCT TGCAAGCACG 600 

ACAGACATCT GCCAAGOCAA ACAGGCCCGA AACCACTCTG OCACCAOCAT ACCCACATTG 660 
65 G AATGCTGTC ATGAAGACAT GTGCAATTAC AG AGGGCTOC AOGATGTTCT CTCTCCTCCC 720 

AGGGGTG AGO CCTCAGGACA AGGAAACAGG TATCAGCATG ATGGTAGCAG AAACCITATC 780 

ACCAAGGTGC AGGAGCTGAC TTCTTCCAAA GAGTTGTGGT TCCGGGCAGC GGTCATTGCC 840 

GTGCCCATTG CTGGAGGGCT GATTTTAGTG TTGCTTATTA TGTTGGCCCT GAGGATGCTT 900 

CG AAGTG AAA ATAAG AGGCT GCAGG ATCAG CGGCAACAG A TGCTCTCCCG TTTGCACTAC 960 
70 AGCnrCAOG GACACCATTC CAAAAAGGGG CAGGTTGCAA AGTTAOACrT GGAATGCATG 1020 

GTGCCGGTCA GTGGGCACGA GAACTGCTOT CTGACCTGTG ATAAAATGAG ACAAGCAGAC 1080 $ 

CICAGCAACG ATAAO ATCCT CTCGCTTGTT CACTGGGGCA TGTACAGT6G GCACGGGAAG 1140 

CTGGAATTCG TATGACGGAG TCTTATCTGA ACTACACTTA CTGAACAGCT TGAAGGOCTT 1200 

TTQAGTTCTG CTGGACAGGA GGACTTTATC TOAAGACAAA CTCATTTAAT CATCTTTGAG 1260 
75 AGACAAAATG ACCTCTGCAA ACAOAATCTT OGAT A 1 1 1C1 1 C l bA AGGAT TATTTGCACA 1320 

GACTTAAATA CAGTTAAATG TGTTATTTGC TTTTAAAATT ATAAAAAGCA AAGAGAAGAC 1380 

TTTGTACACA CTGTCACCAG GGTTATTTGC ATCCAAGGGA GCTGGAATTG AGTACCTAAA 1440 
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TAAACAAAAA TOTOCCCTAT GTAAGCTTCT ACATCTTG AT TTATTGTAAA GATTTAAAAG 1500 
AAATATATAT ATTTTGTCTG A 

Seq ID NO 134 Protein seaoencc: 
5 Protem Access** fc NPG36474.I 

1 11 21 31 41 51 

MDRHSSYIFI W I^LEuUmaWtKGEIRC YCDAAHCVAT GYMOCSELSA CFSRLLDPQN 60 
10 SNSPLTHGCL DSLASTTDIC QAKQARNHSG TTIPTLECCH EDMCNYRGLH DVLSPPRGEA 120 
SGOGNRYQHD GSRNLITKVQ ELTSSKELWF RAAV1AVP1A GGULVLUM LALRMLRSEN 180 
KRLQDQRQQM LSRLHYSFHO HHSKKGQVAK LDLECMVPVS GHENCCLTCD KMRQADLSND 240 
RILSLVHWGM YSGHGKUEFV 

15 Seq ID NO: 135 DNA sequence 

Nucleic Add Accession & NMJD01627.1 

Coding sequence: 64-1815 (underifoed sequew» ccntsporri ^ 

1 II 21 31 41 51 

^ CGGGACGACG CCCCCTCCTG CGGCGTGGAC TCCGTCAGTG GCCCACCAAG AAGGAGGAGG 60 
AATATGOAAT CCAAGGGGGC CAGTTCCTGC CGTCTGCTCT TCTGCCTCTT GATCTCCGCC 120 
A(X«TCTTCA GGCCAGGCCT TGGATGGTAT ACTGTAAATT CAGCATATGG AG ATACCATT 180 
ATCATACCTT GCCGACTTGA CGTACCTCAG AATCTCATGTTTGGCAAATG GAAAJATOAA 240 

25 AAGCCCGATG GCTCCCCAGT ATTTATTGCC TTCAGATCCT CTACAAAGAA AAGTGTGCAG 300 
TACGACGATG TACCAG AATA CAAAGACAG A TTGAACCTCT CAGAAAACTA CACTTTGTCT 360 
ATCAGTAATG CAAGGATCAG TGATGAAAAG AGATTTGTGT GCATGCTAGT AACTGAGGAC 420 
AACGTGTTTG AGGCACCTAC AATAGTCAAG GTGTTCAAGC AACCATCTAA AOCTCAAATT 480 
GTAAGCAAAG CACTGTTTCT CGAAACAGAG CAGCTAAAAA AGTTGGGTGA CTGCATTTCA 540 

30 GAAGACAGTT ATCCAGATGG CAATATCACA TGGTACAGGA ATGGAAAAGT GCTACATCTC 600 
CTTGAAGGAG CGGTGGTCAT AATTTTTAAA AAGGAAATGG ACCCAGTGAC TCAGCTCTAT 660 
ACCATGACTT CCACCCTGGA GTACAAGACA ACCAAGGCTG ACATACAAAT GCCATTCACC 720 
TGCTCGGTGA CATATTATGG ACCATCTGGC CAGAAAACAA TTCATTCTGA ACAGGCAGTA 780 
TTTGATATrr ACTATCCTAC AGAGCAGGTG ACAATACAAG " K *^ G ^^9£^ A ^^9?^ ** 

35 ATCAAAGAAG GGGATAACAT CACTCTTAAA TGCTTAGGGA ATGGCAAC^ TOCCCCAGAG 900 
GAATTTTTGT TTTACTTACC AGG ACAGCCC GAAGGAATAA GAAGCTCAAA TACTTACACA 960 
CTGATGGATG TGAGGCGCAA TGCAACAGGA GACTACAAGT GTTCCCTG AT AGACAAAAAA 1020 
AGCATGATTO CTTCAACAGC CATCACAGTT CACTATTTGG ATTTGTCCTT AAAOCCAAGT^ 1080 
GGAGAAGTGA CTAG ACAGAT TGGTGATGCC CTACCCGTGT CATGCACAAT ATCTGCTAGC 1140 

40 AGGAATGCAA CTGTGGTATG GATGAAAG AT AACATCAGGC TTCGATCTAG CCCGTCATTT 1200 
TCTAGTCTTC ATTATCAGGA TGCTGGAAAC TATGTCTGCG AAACTGCTCT "CCACC I AGGTT 1260 
GAAGGACTAA AGAAAAGAGA GTCATTGACT CTCATTGTAG AAGGCAAACC TCAAATAAAA 1320 
ATGACAAAGA AAACTGATCC CAGTGGACTA TCTAAAACAA TAATCTGCCA TGTGGAAGGT 1380 
TTTCCAAAGC CAGCCATTCA GTGGACAATT ACTGGCAGTG GAAGCGTCAT AAACCAAACA 1440 

45 GAGGAATCTC CTTATATTAA TGGCAGGTAT TATAGTAAAA TTATCATTTC CCCTGAAGAG 1500 
AATGTTACAT TAACTTGCAC AGCAGAAAAC CAACTGGAGA GAACAOTAAA CTCCTTGAAT 1560 
GTCTCTOCTA TAAGTATTCC AGAACACGAT OAGGCAGACG AGATAAGTGA TGAAAACAGA 1C0 
GAAAAGGTGA ATGACCAGGC AAAACTAATT GTGGGAATCG ^^^^^^^^J^J Jf* 
GCCCTTGTTG CTGGTGTCGT CTACTGGCTG TACATGAAGA AGTCAAAGAC TG^TCAAAA 1740 

50 CATGTAAACA AGGACCTCGG TAATATOGAA GAAAACAAAA AGTTAGAAGA AAAOVATCAC 1800 
AAAACTGAAG CCTAAGAGAG AAACTGTCCT AGTTGTCCAG AGATAAAAAT CATATAG ACC I860 
AATTGAAGCA TGAACGTGGA TTGTATTTAA GACATAAACA AAGACATTGA CAGCAATTCA 1920 
TGGTTCAAGT ATTAAGCAGT TCATTCTACC AAGCTGTCAC AOO TTTTC AC iAGAATTATCT 1980 
CAAGTAAAAC AAATGAAATT TAATTACAAA CAATAAG AAC AAGTTTTGGC AGCCATGATA 2040 

55 ATAGGTCATA W ll tflUUI GGTTCAATTT TTTTTCOGTA AATCTCTGCA C™™*™? 

CITTTTGGTT TGCCTTTTAT GTAAATTTTT TACGTAGCTA TTTTTA TACA CTGTAAGCTT 2160 
TGTTCTOGGA GTTGCTGTTA ATCTGATGTA TAATGTAATG TTTTTATTTC AATTGTTTAT 2220 
ATGGATAATC TO AGCAGGTA CATTTCTG AT TCTGATTGCT ATCAGCAATG CCCCAAACTT 2280 
TCTCATAAGC ACCTAAAACC CAAAGGTGGC AGCTTGTGAA GATTGGGGAC ACTCATATTG 2340 

60 CCCTAATTAA AAACTGTGAT TTTTATCACA AGOG AGGGGA GGCCGAG AGT CAG ACTG ATA 2400 
GACACCATAG GAGCCOACTC TTTG ATATGC CACCAGCGAA CTCTCAGAAA TAAATCACAG 2460 
ATGCATATAG ACACACATAC ATAATGGTAC TCCCAAACTG ACAATTTTAC CTATTCTG AA 2520 
AAAGACATAA AACAGAATT 

65 Seq ID NO: 136 Pfotejn sequence; 

Pcotein Accessions NPJD0I6I8.1 

1 II 21 31 41 51 

70 MESXGASSCR LLFCLLtSAT VFRPGLGWYT VNSAYGDTI1 IPCRLDVPON LMFGKWKYEK 60 
PDGSPVF1AF RSSTKKSVQY DDVPEYKDRL NLSHNYTLSI SNARKDEKR FVCMLVTEDN 120 
VFBAPnVKV FKQPSKPHV SKALFLETEQ LKKLG DOSE DSYPDGNTTW YRNGKVLHPL 180 
EGAWIIFKK BMDPVTQLYT MTSTLHYKTT KADIOMPFTC SVTYYGPSGQ KTIHSBQA VF 240 
DIYYPTEQVT IQVLPPKN AI KEGDNTTLKC LGNGNPPPEE FLFYLPGQPB GIRSSNTYTL 300 

75 MDVRKNATGD YKCSLIDKKS MIASTAITVH YLDLSLNPSG BVTRQIGDAL FVSCTISASR 360 
NATWWMKDN IRLRSSPSFS SLHYQDAGNY VCBTALQEVB GLKKRESLTL IVEGKPQIKM 420 
TKKTDPSGLS KXnCHVEGF PKPAIQWT1T GSGSV1KOTE ESPYINGRYY SKmSPEEN 480 
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VTLTCTAENQ LERTVNSLNV SAISEPEHDB ADBISDENRE KVNDQAKJJV GIWGLLLAA 540 
LVAGWYWLY MKKSKTASKH VNKDLGNMEB NKKLEENNHK TEA 



Seq ID NO: 137 DNA sequence 

Nucleic Acid Accession #: XM JB0559 

Coding sequence: 1-1 19 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

ATgAcCXJCA GCCACCGGCA CGGGGCGOGC AGCGGCTGCC TGGGCACTAT GGAGGTGAAG 60 
AQCAAOTTTO GAGCTGAATT TCQTOGGTTT TCGCTGO AAA GATCA AAAC C TGOAAAA TTT 120 
GAGGAGTTrT ATGGATTACT ACAACATGTT CATAAGATCC CCAATGTTGA CGTTTTGGTA 180 
GGCTATGCAG ACATCCATGG AGACTtACTA CCTATAAATA ATGATG ATAA TTATCACAAA 240 
GCTGTTTCAA CX3GCCAATCC ACTGCTTAGG ATATTTATAC AAAAGAAQGA AGAAGCAOAC 300 
TACAGTGCCT TTGGTACAGA CACGCTAATA AAG AAGAAGA ATGTTTTAAC CAACGTATTG 360 
CGTCCTOACA ACCATAGAAA AAAGCCACAT ATAGTCATTA GTATGCCCCA AGACTTTAGA 420 
CCTGTGTCTT CTATTATAGA CGTGGATATT CTCCCAG AAA CGCATCGTAG GGTACGTCTT 480 
TACAAATACG GCACGGAGAA ACCCCTAGGA TTCTACATCC GGGATGGCTC CAGTGTCAGG 540 
GTAACACCAC ATGGCTTAGA AAAGGTTOCA GGGATCTTTA TATCCAGGCT TGTCCCAGGA 600 
GGTCTGGCTC AAAGTACAGG ACT ATTAGCT GTTAATGATG AAGTTTTAGA AGTTAATGGC 660 
ATAGAAGTTT CAGGGAA6AG CCTTGATCAA GTAACAGACA TGATG ATTGC AAATAGCCGT 720 
AACCTCATCA TAACAGTGAG ACCGGCAAAC CAGAGGAATA ATGTTGTGAG GAACAGTCGG 780 
ACTTCTGGCA GTTCOGGTCA GTCTACTGAT AACAGCCTTC TTGGCTACCC ACAGCAGATT 840 
GAAGCAAGCT TTGAGCCAGA GGATGAAGAC AGCG AAGAAG ATGACATTAT CATTGAAGAC 900 
AATGGAGTGC CACAGCAGAT TCCAAAAGCT GTTCCTAATA CTGAGAGCCT GGAGTCATTA 960 
ACACAGATAG AGCTAAGCTT TGAGTCTGGA CAGAATGGCT TTATTCCCTC TAATGAAGTG 1020 
AGCTTAGCAG CCATAGCAAG CAGCTCAAAC ACGGAATTTG AAACACATGC TCCAG ATCAA 1080 
AAACTCTTAG AAGAAGATGG AACAATCATA ACATTATGA 

SeqlDNO. 138 Protein sequence: 
Protein Accession Si XPJB30559 

1 11 21 31 41 51 

MhTRSHRHG AG SGCLGTMEVK SKFGAEFRRF SLERSKPGKF HEFYGUjQHV HK1PNVDVLV 60 
GYADTHGDLL PINNDDNYHK A VST ANPLLR IFIOKKBEAD YSAFGTDTU KKKNVLTNVL 120 
RPDNHRKKPH IVISMPQDFR PVSSnDVDI LPETHRRVRL YKYGTEKPLG FYIRDGSSVR 180 
VTPHGLEKVP GIF1SRLVPG GLAQSTGLLA VNDEVLEVNG EEVSGKSLDQ VTDMMIANSR 240 
NLHTVRPAN QRNNWRNSR TSGSSGQSTD NSLLGYPQQI EPSPEPEDED SEEDP IHED 300 
NGVPQQIPKA VPNTESLESL TQIKLSFESG QNGF1PSNEV SLAAIASSSN TEFETHAPDQ 360 
KLLEEDGTHTL 
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It is understood that the examples described above in no way serve to limit the 
true scope of this invention, but rather are presented for illustrative purposes. All 
publications, sequences of accession numbers, and patent applications cited in this 
specification are herein incorporated by reference as if each individual publication or patent 
application were specifically and individually indicated to be incorporated by reference. 
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WHAT IS CLAIMED IS: 



1 LA method of detecting a breast cancer-associated transcript in a cell 

2 from a patient, the method comprising contacting a biological sample from the patient with a 

3 polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 

4 as shown in Tables 1-25. 

1 2. The method of claim 1, wherein the biological sample comprises 

2 isolated nucleic acids. 

1 3 . The method of claim 2, wherein the nucleic acids are mRNA. 

1 4. The method of claim 2, further comprising the step of amplifying 

2 nucleic acids before the step of contacting the biological sample with the polynucleotide. 

1 5. The method of claim 1 , wherein the polynucleotide comprises a 

2 sequence as shown in Tables 1 -25 . 

1 6. The method of claim 1 , wherein the polynucleotide is immobilized on 

2 a solid surface. 

1 7. The method of claim 1 , wherein the patient is undergoing a therapeutic 

2 regimen to treat breast cancer. 

1 8. The method of claim 1 , wherein the patient is suspected of having 

2 breast cancer. 

1 9. An isolated nucleic acid molecule consisting of a polynucleotide 

2 sequence as shown in Tables 1 -25. 

1 10. The nucleic acid molecule of claim 9, which is labeled, 

1 11. An expression vector comprising the nucleic acid of claim 9. 

1 12. A host cell comprising the expression vector of claim 1 1 . 
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1 13. An isolated polypeptide which is encoded by a nucleic acid molecule 

2 having polynucleotide sequence as shown in Tables 1 -25 . 

1 14. An antibody that specifically binds a polypeptide of claim 13. 

1 15. The antibody of claim 14, further conjugated to an effector component 

1 16. The antibody of claim 15, wherein the effector component is a 

2 fluorescent label. 

1 17. The antibody of claim 15, wherein the effector component is a 

2 radioisotope or a cytotoxic chemical. 

1 18. The antibody of claim 15, which is an antibody fragment 

1 1 19. The antibody of claim 15, which is a humanized antibody 

1 20. A method of detecting a breast cancer cell in a biological sample from 

2 a patient, the method comprising contacting the biological sample with an antibody of claim 

3 14. 

1 21. The method of claim 20, wherein the antibody is further conjugated to 

2 an effector component 

1 22. The method of claim 21, wherein rife effector component is a 

2 fluorescent label. 

1 23. A method for identifying a compound that modulates a breast cancer- 

2 associated polypeptide, the method comprising the steps of: 

3 (i) contacting the compound with a breast cancer-associated polypeptide, the 

4 polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence at least 

5 80% identical to a sequence as shown in Tables 1-25; and 

6 (ii) determining the functional effect of the compound upon the polypeptide. 
1 24. A drag screening assay comprising the steps of 
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2 (i) administering a test compound to a mammal having breast cancer or a cell 

3 isolated therefrom; 

4 (ii) comparing the level of gene expression of a polynucleotide that selectively 

5 hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1-25 in a 

6 treated cell or mammal with the level of gene expression of the polynucleotide in a control 

7 cell or mammal, wherein a test compound that modulates the level of expression of the 

8 polynucleotide is a candidate for the treatment of breast cancer. 



413 



WO 02/059377 PCT/US02/02242 



TTTTCAAGAA CTAACACAGT TATTCCTATA CTGGATTTTA GGTCTCTGAA GAACTGCTGG 2760 
TGTTTAGGAA TAAOAATGTO CATGAAGCCT AAAATACCAA OAAAOCTTAT ACTGAATTTA 2820 
AGCAAAGAAA TAAAGGAGAA AAGAGAAGAA TCTGAG AATT GGGGAGGCAT AGATTCTTAT 2880 
AAAAATCACA AAATTTGTTG TAAATTAGAG GGGAGAAATT TAGAATTAAG TATAAAAAGG 2940 
5 CAG AATTAGT ATAGAGTACA TTCATTAAAC ATTTTTGTCA GGATTATTTC CCGTAAAAAC 3000 
GTAGTGAGCA CTCTCATATA CTAATTAGTG TACATTTAAC nTGTATAAT ACAGAAATCT 3060 
AAATATATTT AATGAATTCA AGCAATATAC ACTTGACCAA GAAATTGGAA TTTCAAAATO 3120 
TTCGTGCGGG TTATATACCA GATGAGTACA GTGAGTAGTT TATGTATCAC CAGACTGGGT 3180 
TATTGCCAAG TTATATATCA CCAAAAGCTG TATGACTGGA TGTTCTGGTT ACCTGGTTTA 3240 
10 CAAAATTATC AG AGTAGTAA AACTTTG ATA TATATGAGGA TATTAAAACT ACACTAAGTA 3300 
TCATTTGATT CGATTCAGAA AGTACTTTGA TATCTCTCAG T GCTTC AGTO CTATCATTGT 3360 
G AGCAATTGT CTTTATATAC GGTACTGTAG CCATACTAGG CCTGTCTGTG GCATTCTCTA 3420 
GATGTTTCTT TTTTACACAA TAAATTCCTT ATATCAGCTT G 

15 Seq ID NO: 30 Protein sequence 

Protein Accession #: NPJQ36451.2 

1 li 21 31 41 51 

20 MAJRKLSVILl LTF ALSVTNP LHELKAAAFP QTTEKISFNW ESQINVDLAI STRQYHLQQL 60 

FYRYGBNNSL SVEGFRKLLQ NIGIDKIKRI HIHHDHDHHS DHEHHSDHHR HSDHEHHSDH 120 

EHHSDHDHHS HIINHAASGKN KRKALCPDHD SDSSGKDFKN SQGKG AHRPB HASGRRNVKD 180 

SVSASEVTCT VYNTVSEGTH FLBTIETPRP GKLFPKDVSS STPPSVTSKS RVSRLAGRKT 240 

NESVSEPRKG FMYSRNTNEN PQECFNASXL LTSHGMGIQV PLNATEFNYL CPAIINQIDA 300 
25 RSOJHTSEK KAEIPPKTYS LQlAWVGGFI AISOSFLSL LGVILVPLMN RVFFKFLLSF 360 

LVALAVGTLS GDAFLHLLPH SHASHHHSHS HEEP AMEMKR GPLFSHLSSQ NIEESAYFDS 420 

TWKGLTALGG LYFMFLVEHV LTUKQFKDK KKKNQKKPEN DDDVEDCKQL SKYESQLSTN 480 

EEK.VDTDDRT EGYLRADSQE PSHFDSQQPA VLEEEEVM1A HAHPQEVYNE YVPRGOCNKC 540 

HSHFHDTLGQ SDDLIHHHHD YHHILHHHHH QNHHPHSHSQ RYSREELKDA GVATLAWMVI 600 
30 MGDGLHNFSD GLAJGAAFTE GLSSGLSTSV AVFCHELPHE LGDFAVUXA GMTVKQAVLY 660 

NALSAMLAYL GMATGIFIGH YABNVSMWIF ALTAGLFMYV ALVDMVPEML HNDASDHGCS 720 

RWGYFFLQNA GMLLGFGIML LISIFEHKIV FRINF 

Seq ID NO: 31 DNA sequence 
35 Nucleic Add Accession #: NMJD02184J 

Coding sequence; 256-30 1 ffundcrimed sequences coiiespond to start and stop codons) 

1 11 21 31 41 51 

40 GAGCAGCCAA AAGGCXXXiOG GAGTCGCGCT GGGCCGCCCC GGCGCAGCTO AACCGGGGGC 60 

CGCGCCTGCC AGGCCGACGG GTCTGGCCCA GCCTGGCGCC AAGGGGTTCG TGCGCTGTGG 120 

AGACGCGGAG GGTCGAGGCG GCGCGGCCTG AGTG AAACCC AATGGAAAAA GC ATGAC ATT 180 

TAGAAGTAGA AG ACTTAGCT TCAAATCCCT ACTCCTTCAC TTACTAATTT TGTGATTTGG 240 

AAATATCCGC GCAAGATGTT GACGTTGCAG ACTTGGGTAG TGCAAGCCTT GTrTATTTTC 300 
45 CTCACX^CTG AATCTACAGG TGAACTTCTA GATCCATGTG GTTATATCAG TCCTGAATCT 360 

CCAGTTGTAC AACTTCATTC TAATTTCACT GCAGTTTGTG TGCTAAAGGA AAAATGTATG 420 

GATTATTTTC ATGTAAATGC TAATTACATT GTCTGGAAAA CAAACCATTT TACTATTCCT 480 

AAGGAGCAAT ATACTATCAT AAACAGAACA GCATCCAGTG TCAOCTTTAC AGATATAGCT 540 

TCATTAAATA TTCAGCTCAC TTGCAACATT CTTACATTCG GACAGCTTGA ACAGAATXJTT 600 
50 TATGG AATCA CAATAATTTC AGGCTTGCCT CCAGAAAAAC CTAAAAATTT GAGTTGCATT 660 

GTGAAOGAGG GG AAGAAAAT GAGGTOTGAG TGGGATGGTG G AAGGGAAAC ACACTTGGAG 720 

ACAAACTTCA CTTTAAAATC TGAATGGGCA ACACACAAGT TTG CTGA TTG CAAAGCAAAA 780 

CGTGACACXX: CCACCTCATG CACTGTTGAT TATTCTACTG TGTATTTTGT CAAG ATTGAA 840 

GTCTGGGTAG AAGCAGAGAA TGCCCTTGGG AAGGTTACAT CAGATCATAT CAATTTTGAT 900 
55 CCTGTATATA AAGTGAAGCC CAATCCGCCA CATAATTTAT CAGTGATCAA CTCAGAGGAA 960 

CTGTCTAGTA TCTTAAAATT G ACATGGACC AACCCAAGTA TTAAGAGTGT TATAATACTA 1020 

AAATATAACA TTCAATATAG G ACCAAAGAT GCCTCAACTT GGAGCCAG AT TCCTCCTGAA 1080 

GACACAGCAT CCACCCGATC TTCATTCACT GTCCAAGACC TTAAACCTTT TACAGAATAT 1140 

GTGTTTAGG A TTCGCTGTAT GAAGGAAGAT GGTAAGGGAT ACTGGA GTGA CTGGAGTGAA 1200 
60 GAAGCAAGTG GGATCACCTA TG AAGATAGA CCATCTAAAG CACCAAGTTT CTGGTATAAA 1260 

ATAG ATCCAT CCCATACTCA AGGCTACAGA ACTGTACAAC TCGTGTGGAA GACATTGCCT 1320 

CCTTTTGAAG CCAATOGAAA AATCTTGGAT TATG AAGTGA CICTCACAAO ATOGAAATCA 1380 

CATTTACAAA ATTACACAGT TAATGCCACA AAACTGACAG TAAATCTCAC AAATGATCGC 1440 

TATCTAGCAA CXXTAACAGT AAGAAATCTT GTTGGCAAAT CAGATGCAGC TGTTTTAACT 1500 
65 ATCCCTGCCT GTGAC7TTCA AGCTACTCAC CCTGTAATGG ATCTTAAAGC ATTCCCCAAA 1560 

GATAACATGC TTTGGGTGGA ATGGACTACT CCAAGGGAAT CTGTAAAGAA ATATATACTT 1620 

GAGTGGTGTG TGTTATCAGA TAAAGCACCC TGTATCACAG ACTGGCAACA AGAAGATGGT 1680 

ACOGTGCATC GCAOCTATTT AAGAGOGAAC TTAGCAGAGA GCAAATGCTA TTTGATAACA 1740 

GTTACTCCAG TATATGCTG A TGGACCAGGA AGCCCTGAAT CCATAAAGGC ATACCTTAAA 1800 
70 CAAGCTCCAC CTTCCAAAGG ACCTACTGTT CGGACAAAAA AAGTAGOGAA AAACGAAGCT 1860 

GTCTTAGAOT GGGACCAACT TCCTGTTOAT GTTCAGAATG GATTTATCAG AAATTATACT 1920 f 

ATATTTTATA G AACCATCAT TGGAAATGAA ACTGCTGTGA ATGTGGATTC TTCCCACACA 1980 

GAATATACAT TGTCCrCTTT GACTAGTGAC ACATTGTACA TGGTACGAAT GGCAGCATAC 2040 

ACAGATGAAG GTGGGAAGGA TGGTCCAGAA TTCACTTTTA CTACCCCAAA OTTTGCTCAA 2100 
75 GGAGAAATTG AAGCCATAGT CGTGCCTGTT TGCTTAGCAT TCCTATTGAC AACTCTTCTG 2160 

GGACTGCTGT TCTGCTTTAA TAAGCGAGAC CTAATTAAAA AACACATCTG GCCTAATGTT 2220 

CCAG ATCCTT CAAAGAGTCA TATTGCCCAG TGGTCACCTC ACACTCCTCC AAGGCACAAT 2280 
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TTTAATTGAA AAG ATCAAAT GTATTCAGAT GGCAATTTCA CTOATGTAAG TGTTGTGQAA 2340 
ATAGAAGCAA ATOACAAAAA OCCTTTTCCA GAAGATCTOA AATCATTGGA CCTGTTCAAA 2400 
AAGGAAAAAA TTAATACTGA AGGACACAGC AGTGGTATTG GGGGGTCTTC ATGCATGTCA 2460 
TCTTCTAGGC CAAGCATTTC TAGCAGTGAT GAAAATGAAT CTTCACAAAA CACTTCGAGC 2520 
ACTGTCCAGT ATTCTACCGT OGTACACAGT GGCTACAG AC ACCAAGTTCC GTCAGTCCAA 2580 
GTCTTCTCAA GATCCGAGTC TACCCAGCCC TTGTTAGATT CAGAGG AGOG GCCAGAAGAT 2640 
CTACAATTAG TAGATCATGT AGATGGCGOT GATGGTATTT TGCCCAGGCA ACAGTACTTC 2700 
AAACAGAACT GCAGTCAGCA TGAATCCAGT CCAGATATTT CACATTTTGA AAGGTCAAAG 2760 
CAAGTTTCAT CAGTCAATGA GGAAGATTTT GTTAGACTTA AACAGCAGAT TTCAG ATCAT 2820 
ATTTCACAAT CCTGTGGATC TOGGCAAATG AAAATGTTTC AGGAAGTTTC TGCAGCAGAT 2880 
GC1 1 1 IGGTC CAGGTACTGA GGGACAAGTA GAAAGATTTG AAACAGTTGG CATGGAGGCT 2940 
GCGACTG ATG AAGGCATGCC TAAAAGTTAC TTACCACAGA CTGTACGGCA AGGOGGCTAC 3000 
ATGCCTCA GT OAA GOACTAG TAGTTCCTGC TACAACTTCA GCAGTACCTA TAAAGTAAAG 3060 
CTAAAATGAT TTTATCTGTG AATTC 

Seo ID NO: 32 Protein aeouence: 
Protein Accession ft NP0O2I75.1 

I 

1 11 21 31 41 51 
I i I I I I 

MLTLOTWWQ ALFIFLTTES TGELLDPCGY ISPESPWQL HSNFTAVCVL KEKCMDYFHV 60 
NANYTVWKTN HFTIPKEQYT IINRTASSVT FTDIASLNIQ LTCNILTFGQ LEQNVYGITI 120 
ISGLPPEKPK NLSCTVNEGK KMRCEWDGGR ETHLETNFTL KSEWATHKFA DCXAKRDTPT 180 
SCTVDYSTVY FVNIEVWVEA HNALGK VTSD HINFDPVYKV KPNPPHNLSV INSEELSSZL 240 
KLTWTNPSK SVHLKYNIQ YRTKDASTWS QIPPEDTAST RSSFTVQDLK PFTEYVFRIR 300 
CMKEDGKGYW SDWSEEASGI TYEDRPSKAP SFWYKIDPSH TQGYRTVQLV WKTLPPFEAN 360 
GKILDYEVTL TRWKSHLQNY TVNATKLTVN LTNDRYLATL TVRNLVGKSD AAVLTTPACD 420 
FQATHPVMDL KAFFKDHMLW VEWTTPRESV KKYILEWCVL SDKAPOTDW QQEDGTVHRT 480 
YIAGNLAESJC CYLirVTPVY ADGPGSPBSI KAYLKQAPPS KGPTVRTKKV GXNEAVLEWD 540 
QLPVDVQNGF KNYTIFYRT IIGNETAVNV DSSHTEYTLS SLTSDTLYMV RMAAYTDEGG 600 
KDGPEFTFTT PKFAQGEffiA IWPVCLAFL LTTLLGVLFC FNKRDUKKH IWPNVPDPSK 660 
SHIAQWSPHT PPRHNFNSKD QMYSDGNFTD VSWEtEAND KKPFPEDLKS LDLFKKEKIN 720 
TBGHSSGIGG SSCMSSSRPS ISSSDEKBSS ONTSSTVQYS TWHSGYRHQ VPSVQVFSRS 780 
ESTQPIXDSB ERPEDLQLVD HVDGGDGILP RQQYFKQNCS QHESS>DISI FERSKQVSSV 840 
NEEDFVRLKQ Q1SDHISQSC GSGOMKMFQE VSAADAFGPG TEGQVERFET VGMEAATDEG 900 
MPKSYLPQTVRQGGYMPQ 

Scq ID NO: 33 DNA sequence 

Nndek Add Accession ft NM_01 8255 J 

Coding sequence: 11-2491 (underiined sequences concspond to start and stop codons) 
1 II 21 31 41 51 

11)111 

AGTTGGCGAC ATGGTGGCAC CCGTGCTGGA GACTTCTCAC GTGTTTTGCT GCCCAAACCG 60 
GGTGCGGGG A GTCCTGAACT GGAGCTCTGG GCCCAGAGGA CTTCTGGCCT TTGGCACGTC 120 
CTGCTCCGTC GTCCTCTATG ACCCCCTGAA AAGGGTTGTT GTTACCAACT TGAATGGTCA 180 
CACCGCCCGA GTCAATTGCA TACAGTGOAT TTGTAAACAG GATGGCTCCC CTTCTACTG A '240 
ATTAGTTTCT GGAGGATCTG ATAATCAAGT GATTCACTGG G AAATAGAGG ATAATCAGCT 300 
TITAAAAGCA GTGCATCTTC AAGGCCATGA AGOACCTGTT TATGCGGTGC ATGCTGTTTA 360 
CCAGAGGAGG ACATCAGATC CTGCATTATG TACACTGATC GTTTCTGCAG CTGCAGATTC 420 
TGCTGTTCGA CTCTGGTCTA AAAAGGGTCC AGAAGTAATG TGCCTTCAGA CTTTAAACTT 480 
TGOAAATOGA 1 i imCl KUO CTCTCTOCTT A lCITi I ' l 1U CCAAATACTG ATGTACCAAT 540 
ATTAGCATGT GGCAATGATG ATTGCAGAAT TCACATATTT GCTCAACAAA ATGATCAGTT 600 
TCAG AAAGTG CTTTCTCTCT GTGGACATGA GGATTGG ATT AG AGGAGTGG AATGGGCAGC 660 
CTTTGGTAGA GATCTTTTCC TAGCAAGCTO TTCACAAGAT TGCCTGATAA GAATATGGAA 720 
GCTGTATATA AAGTCAACAT CTTTAGAAAC TCAGGATGAC GATAACATAA GACTGAAAGA 780 
AAATACTTTT ACCATAGAAA ATOAAAGTGT TAAAATAGCA TTTGCTGTTA CTCTGGAGAC 840 
AGTGCTAGCC GGTCATGAAA ACTGGGTAAA TGCAGTTCAC TGGCAACCTG TGTTTTACAA 900 
AGATGGTGTC CTACAGCAGC CAGTOAGATT ATTATCTGCT TCCATGGATA AAACCATGAT 960 
TCTCTGGGCT CCAGATGAAG AGTCAGGAGT TTGGCTAGAA CAGGTTCGAG TAGGTGAAGT 1020 
AGGTGGG AAT ACTTTGGGAT TTTATGATTG CCAGTTCAAT GAAGATGGCT CCATGATCAT 1080 
TGCTCATGCT TTCCACGGAG COTTGCACCT TTGGAAACAG AATACAGTTA ACCCAAGAG A 1140 
GTGGACTCCA GAGATTGTCA TTTCAGGACA CTTTGATGGT GTCCAAGACC TAGTCTGGGA 1200 
TCCAGAAGGA GAATTTATTA TCACTGTTGG TACTGATCAG ACAACTAGAC TTTTTGCTCC 1260 
ATGGAAGAGA AAAGACCAAT CACAGGTGAC TTGGCATGAA ATTGCAAGGC CTCAOATACA 1320 
TGOGTATGAC CTO AA ATGTT TGGCAATQAT TAATCOOTTT CAGTTTOTAT CTGGAGCAGA 1380 
TGAAAAAGTT CTTCGGGTTT TTTCTGCACC TCGGAATTTT GTGGAAAATT TTTGTGCCAT 1440 
TACAGOACAA TCACTGAATC ATGTGCTCTC TAATCAAGAT AGTGATCTTC CAGAAGG AGC 1500 
CACTOTCCCT GCATTGGOAT TATCAAATAA AGCTGTCTTT CAGGGAGATA TAGCTTCTCA 1560 
GCCrTCTGAT GAAGAGGAGC TGTTAACTAG TACTGGTTTT GAGTATCAGC AGGTGGCCTT 1620 
TCAGCCCTCC ATACTTACTG AGCCTCCCAC TGAGGATCAT CTTCTGCAG A ATACTTTGTG 1680 
GCCTOAAGTT CA AAAACTAT ATGQGCACGG TTATGAAATA ' It l' lti I XJll 'A CTTQTAACAG 1740 
TTCAAAGACT CIUL'I'I OCCT CAGCTTGTAA GGCAGCTAAG AAAGAGCATG CAGCTATCAT 1800 
TCTTTGGAAC ACTACATCTT GGAAACAGGT GCAGAATTTA GTTTTCCACA GTTTGACAGT 1860 
CACGCAGATG GCCTTCTCAC CTAATGAGAA OTTCTTACTA GCTGTTTCCA GAGATCGAAC 1920 
CTGGTCATTO TGGAAAAAGC AGGATACAAT CTCACCTGAG TTCGAGCCAG TTTTTAGTCT 1980 
TTTTGCCTTC ACCAACAAAA TTACTTCTGT GCACAGTAGA ATTATTTGGT CTTGTGATTG 2040 
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OAOTCCTGAC AGCAAOTATT TCTTCACTGG OAGTCGAOAC AAAAAOGTGG TTGTCTGGGG 2100 
TGAGTGCGAC TCCACTGATG ACTGTATTGA GCACAACATT GGCCCCTGCT CCTCAGTCCT 2160 
GGACGTGGGT GGGGCTGTG A CAGCTGTCAG CGTCTGCCCA GTGCTCCACC CTTCTCAACG 2220 
ATACGTGGTT GCAGTAGG AT TGGAGTGTGG AAAGATTTGC TTATATACCT GG AAAAAGAC 2280 
5 TGATCAAGTT CCAGAAATAA ATGACTGGAC CCACTGTGTA GAAACAAGTC AAAGCCAAAG 2340 
TCATACACTG GCTATCAG AA AATTATGCTG GAAGAATTGC AGTGGAAAAA CTG AACAGAA 2400 
GGAAGCAGAA GGTGCTGAGT GGTTACACTT TGCAAGCTGT GGTGAAGATC ACACTGTGAA 2460 
GATACACAGA GTCAATAAAT GTGCACT GTA AT GG 

10 S^roNO:34 Protrfnseqiience; 

Protein Accession #: * NP060725.1 

1 II 21 31 41 51 

15 MVAPVLETSH VFCCPNRVRG VLNWSSGPRG LLAPGTSCSV VLYDPLKRW VTNLNGHTAR 60 
VHCIQWKXQ DGSPSTELVS GGSDNQVIHW HEDNQLLKA VHLQGHEGPV Y AVHAVYQRR 120 
TSDPALCTU VSAAADSAVR LWSKKGPEVM CLQTLNFGNG FALALCLSFL PNTDVPELAC 180 
GNDDCRDflF AQQNDQFQKV LSLCGHEDW1 RGVEWAAFGR DLFLASCSQD CURIWKLY1 240 
KSTSLETQDD DNIRLKHNTF TTENESVK1A FAVTLETVLA GHENWVNAVH WQPVFYKDGV 300 

20 LQQPVRLLSA SMDKTMILWA PDEESGVWLE QVRVQEVGGN TLGFYDCQFN EDGSMHAHA 360 
FHGALHLWKQ NTVNPREWTP EIVISGHFDG VQDLVWDPEG EFHTVOTDQ TTRLFAPWKR 420 
KDQSQVTWHB IARPQIHGYD LKCLAMINRF QFVSG ADEKV LRVFSAPRNF VENFCATTGQ 480 
SLNHVLCNQD SDLPEGATVP ALGLSNKAVF QGD1ASQPSD EEELLTSTGF EYQQVAFQPS 540 
nTEPFTEDH LLQNTLWPEV QKLYGHGYH FCVTCNSSKT LLASACKAAK KBHAAHLWN 600 

25 TTSWKQVQNL VPTCLTVTQM AFSPNEKFLL AVSRDRTWSL WKKQDTISPE FEPVFSLFAF 660 
TNKTTSVHSR 1IWSCPWSPD SKYFFTGSKP KKVWWQECD STDDCIBHNI GP CSSVL DVG 720 
G AVTAVSVCP VLHPSQRYW AVGLEOGKIC LYTWKKTDQV PBINDWTBCV ETSQSQSHTL 780 
AIRKLCWKhCSGICrEOKEABGAEWLHFASC GEDHTVKIHR VNKCAL 

30 Seq IDNtt 35 DNA sequence 

NnddcAcidAcccsswoi: NM_022131 

Co£ng wnpmmr. 11-2878 (undolined sequences cm respond to start and stop cottons) 

1 11 21 31 41 51 

35 i i i i i i 

TGCTGCGAGG ATGCTGCCTG GGCGGCTOTO CTGOGTGCCQ CTCCTGCTGO COCTGQGCGT 60 
GGGGAGCGGC AGCGGCGGTG GCGGGG ACAG CCGGCAGCGC CGCCTCCTCG CGGCTAAAGT 120 
CAATAAGCAC AAGCCATGGA TCGAGACTTC ATATCATGGA GTCATAACTG AGAACAATGA 180 
CACAGTCATT TTGGACCCAC CACTGGTAGC OCTGGATAAA GATGCAOXK3 TTCCTTTTGC 240 

40 AGGGGAAATC TQTGCGTTCA AGATCCATOO CCAGQAGCTG COCTTTOAGG CTOTGOTGCT 300 
CAACAAGACA TCAGGAGAGG GCCGGCTCCG TGCCAAGAGC CCCATTGACT GTGAGTTGCA 360 
GAAGGAGTAC ACATTCATCA TCCAGGCCTA TOACTGTGGT OCTGGGCOCC ACGAGACAOC 420 
CTGGAAAAAG TCACACAAGG COGTGGTCCA TATACAGGTG AAGGATGTCA ACGAGTTTGC 480 
TCCCACCTTC AAAGAGCCAG CCTACAAGGC TGTTGTGACG GAGGGCAAGA TCTATGACAG 540 

45 CATTCTGCAG GTGGAGGCCA TTGACGAGGA CTGCTCCCCA CAGTACAGCC AGATCTGCAA 600 
CTATGAAATC GTCACCACAG ATGTGCCTTT TGCCATCGAC AGAAATGGCA ACATCAGGAA 660 
CACTOAOAAO CTOAGCTATG ACAAACAACA CCAOTATGAO ATCCTOOTOA OCGCCTACGA 720 
CTGTGGACAG AAGGCCGCTG CTCAGGACAC CCTGGTGCAG GTGGATGTGA AGCCAGTTTG 780 
CAAGCCTGGC TGGCAAG ACT GGACCAAGAG GATTGAGTAC CAGCCTGGCT COGGGAGCAT 840 

50 GOJUU'lUllt: CCCAGCATOC ACCTOaAOAC OTGOGATGGA OOOtfrUlClT CCCTO CAGAT 900 
CGTCACAGAG CTGCAGACTA ATTACATTGG GAAGGGTTGT GACCGGGAGA CCTACTCTGA 960 
GAAATCCCTT CAGAAGTTAT GTGGAGCCTC CTCTGGCATC ATTGACCTCT TGCCA TOCCC 1020 
TAGCGCTGCC ACCAACTGGA CTGCAGGACT GCTGGTGGAC AGCAGTGAGA TGATCTTCAA 1080 
GTTTGAOGGC AGGCAGGGTG CCAAAATCCC CGATOGGATT GTGOCCAAGA ACCTOACCGA 1140 

55 TCAGTTCACX: ATCACCATGT GGATGAAACA CGGCOCCAGC CCTGGTGTGA GAGCCGAGAA 1200 
GGAAACCATC CTCTGCAACT CAGACAAAAC CGAAATGAAC CGGCATCACT ATGCCCTGTA 1260 
TGTGCACAAC TGCOGCCTCG TCTTTCTCTT GCGGAAGGAC TTCGACCAGG CTG ACACC7T 1320 
TCGCCCCGCG GAGTTCCACT GGAAGCTGG A TCAGATTTGT GACAAAGAGT GGCACTACTA 1380 
TGTCATCAAT GTGGAGTTTC CTGTGGTAAC CTTATACATG GATGGAGCAA CATATGAACC 1440 

60 ATACCTGGTG ACCAACGACT OGCCCATTCA TCCATCTCAC ATAGCCATOC A ACTCA CAOT 1500 
COGCGCTTGT TGGCAAGG AG GAGAAGTCAC CAAACCACAG TTTGCTCAGT TCTTTCATGG 1560 
AAGCCTGGCCAGTCTCACC^TCCGCCCn^ 1620 
CCTGCAGGCC TGCAAGG AAG GGCTGGACAT TAATTCCTTG GAAAGCCTTG GCCAAGGAAT 1680 
AAAGT ATCAC TTCAACCCCT CGCAGTOCAT CCTGGTGATG GAAGGTGACG ACATTGGGAA 1740 

65 CATTAACCGT GCTCTCCAGA AAGTCTCCTA CATCAACTCC AGGCAGTTCC CAACGGCGGG 1800 
TGTGCGGOGC CTCAAAGTAT CCTCCAAAGT CCAGTGCTTT GGGGAAG ACG TATGCATCAG 1860 
TATCCCTGAG GTAGATGCCT ATGTGATGGT CCTCCAGGCC ATCGAGCCCC GGATCACCCT 1920 
CCGGGGCACA GACCACTTCT GGAGACCTGC TGCCCAGTTT GAAAGTGCCA GGGGAGTQAC 1980 
gCTCrrCCCT GATATCAAG A TTGTG AGCAC CTTCGCCAAA ACCGAAGCCC CCGGGGACGT 2040 

70 G AAAACCACA G ACCCCAAAT CAGAAGTCTT AGAOG AAATG CTTCATAACT TAGATTTCTO 2100 
TOACATTTTG GTGATCGGAG GGGACTTGGA CCCAAGGCAG GAGTGCTTGG AGCTCAACCA 2160 
CAGTGAGCTC CACCAACGAC ACCTGGATGC CACTAATTCT ACTGCAGOCr ACTCCATCTA 2220 
CGGTGTGGGC TCCATG AGCC GCTATGAGCA OGTOCTACAT CACATCCGCT ACCGCAACTG 2280 
GCGTOCGGCT TCCCTTGAGG CCCGGCGTTT CCGG ATTAAG TGCTCAGAAC TCAATGGGCG 2340 

75 CTACACTAGC AATGAGTTCA ACTTGGAGGT CAGCATCCTT CATG AAGACC AAGTCTCAGA 2400 
TAAGGAGCAT GTCAATCATC TGATTGTGCA GCCTCCCTTC CTCCAGTCTG TCCATCATCC 2460 
TGAGTCCCGG AGTAGCATCC AGCACAGTTC AGTGGTCCCA AGCATTGCCA CAGTGGTCAT 2520 
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CATCATCTCC GTOTGCATGC TTGTGTTTGT CGTGGCCATG GGTGTGTACC QGGTCCGGAT 2580 
CGCCCACCAG CACTTCATCC AGO AGACTGA GGCTGCCAAO GAATCTG AGA TGGACTGGG A 2640 
CX3ATTCTGCG CTGACTATCA CAGTCAAOCC CATOGAGAAA CATOAAGGAC CAGGGCATGG 2700 
GOAAGATG AG ACTGAGGGAG AAGAGGAGGA AGAAGCCGAG GAAG AAATGA GCTCCAGCAG 2760 
TGGCTCTOAC GACAGOGAAG AGGAGGAGGA GGAGGAAGGG ATGGGCAGAG GCAGACATGG 2820 
GCAGAATGG A GCCAGGCAAG CCCAGCTGGA GTGGGATGAC TCCACCCTCC CCTACTAOTG 2880 
COCAGGGGTC TGCTGCCTGG CCCACATGTC CCTTTTGTAA ACCCTGACCC AGTGTATOCC 2940 
CATGTCTATC ATAOCTCACC TCTGATGTCT GTGACATGTC TGGGAAGGOC 1TCTOCAGCT 3000 
TCCTGGAGCC CACCCTTTAA GCCTTGGGCA CTCCCTGTGT TTCATCCATG GGGAAGTTCC 3060 
AAGAAGCCCA GCATGGCCAT CAGTGAGGAC TTCAGGGTAG ACTTTGTCCT OTAGOCT0CA 3120 
CnCTGCCCT AAGTTCCCCA GCATCCTGAC TACCrOTCTa CAGAGTTTGC ClllUlllH 3180 
TCCTGCAGGG AAGAAGGCCC ACCTTTGTGT CACTCACCTC CCCAGGCTCA GAGTCCCCAA 3240 
GGOCCTGGGG TTCCAACTCA CTGTGCGTCT CCTCCACACA GACCAGTAGG TTCTCCTATG 3300 
CTGACTCCAG GTTGCTTCAT ACAAGG AGGG TOGTTGAACr TCACACACGT AAGGTCTTAG 3360 
TGCTTAACAG TTTAAAGGAA AGTOCTTGTT GAGGCAGAAC TAAGTTTACA GGGA AACG TA 3420 
CACACATTCT CTCTCTCTCT CTCrUICI G r CTATCTAGTT CCOCAGCTrG GAGAGCCTTT 3480 

cxxcrrocrr ctttctgagg ccatataagc ttataagaaa agtcccaaac caagaatagg 3540 

TCCTTGGOCA CAAGCAGGGT CTGATCCCCC ATCAGAGCTA TCTGAGCCTG CCTGTCTGGG 3600 
CACCTGCTGC AACCATGCAG CTACCCTGCC AGGGGCACTC AGCAAACAGA ACCACAGGGC 3660 
CCAGGAGGCA TICCACACAG GCACTGCCCC AGGACAACAC AACAAGGACA GTCACAACAA 3720 
GGACAACAAG GACACAACAC AACACACAAC AAGGACAGTC ACAACAAGCC TAGAGCCAGA 3780 
AAOCAO ATGO AAATGCTAAT GAGGTCAAAC GTAGGCTTCA TGG TGGG TGG AGTCGGGGTG 3840 
GCTGGGCTCC CCCAGGACAG AGGGGACCCT GAGGTTGGCA AGGCICTCAC CACICAGOCT 3900 
TATGGTCCCT TATCTCCTAT CTTCCCTCTT GAGAAAATAC ACXJCTTTCTG CATGTATTAG 3960 
AAACGCACGA GCTCCACCAA GTCTACAATG AAAGTTTGAA ATTTAACTGC AAGGAATTAG 4020 
AAGCATATTT GCAATCATTG CAGCTTCTTC TTTCTTCTGC TCATAAAAGG AGGAACACTT 4080 
TAGATAGAGG GCAAATATAT CTGAAAAOCT AATTTCTTTC TTTTTTTGAT AAGG AAATCT 4140 
TTTCCATCTC CATCCTAACA TGCACAACCT GTGAAGAG AA TTGTTTCTAT AGTAACTGGT 4200 
CTGTGATCTT TTGTCGCCAA GAGAATAGCA GGCAAGAATT AGGGCCTTGA CAGAATTTCC 4260 
ACGAAOCTCT GAGAACA1U1 TITUTI I LXiAA TGTCTGATTC C1CI 1 1G1C A TCAATGTGTA 4320 
TGCTCTGTCC OCATOCITCA CTCCTCCTCA AGCTCACACC AATltKiTi'IG GCACAGGCAC 4380 
AGAGCTGGTC CCTAGTT AAG TGGCATTTAT GTTAAAAAAA AATAGTTCAG AATCTCAGCC 4440 
UriCl HO T GTCATCAAAA CAGCTTAAGA AGGGGACTAC TGCCAATGTC CTCTAGTCTG 4500 
ACCTOCACCC AGGGAGGACC CATGGCAGGT CTTTTCAACT TTCTGATTC A TGAG AACAAC 4560 
CTTOTGAAGC TTTTCCCACC TCCTAAAOTG TTTTCTGCAT ClUHUL UC CTTTGGACCT 4620 
CACAACAAAT OCTGTGAAGT AACTGAGACA TCTOTTGTTA GATACATTTT TGTGATGAGT 4680 
AAACTGAGGC TTCG 

Seq ID NO: 36 Protein sequence: 
Protein Accession ft NP_07I414J 

1 11 21 31 41 51 

IoJKjRLCWVP LLLALGVGSG SGGGGDSRQR RLLAAKVNKH KPWIETSYHG VTTENNOTV1 60 
LDPPLVAUDK DAPVPFAGEI CAFK3HGQEL PFEAWLNKT SGBGRLRAKS PIDCELCjKEY 120 
TFBQAYDCG AGPHETAWKK SHKAWHIOV KDVNEFAPTF KEPAYKAWT EGKIYDSILQ 180 
VHAIDEDCSP OYSQICNYH VTTDVPFAID RNGNIRNTEK LSYDKQBQYE ILVTA YDCGQ 240 
KPAAQDTLVQ VDVKPVCKPG WQDWTKRIEY QPGSGSIPir PSIHLETCDQ AVSSLQIVTE 300 
LQTNYIGKGC DRETYSHKSL QKLCGASSGI IDLLPSPSAA TNWTAG1XVD SSEMIFKFDG 360 
RQGAKIFDG1 VPKNLTDQPT ITMWMKHGPS PGVRAEKETI LCNSDKTEMN RHHY ALYVHN 420 
CRLVFLLRKD FDQADTFRPA EFHWKLDQIC DXEWHYYVIN VEFFWTLYM DGATYEPYLV 480 
TNDWPIHPSH IAMQLTVGAC WQGGEVTKPQ FAQFFHGSLA SLTIRPGKMB SQKVISCLQA 540 
CKEGLDINSL BSLGQGDCYH FNPSQSILVM EGDWGNINR AIjQKVSYINS ROFPTAGVRR 600 
LKVSSKVQCP GEDVOSIPE VDAYVMVLQA EPRITLROT DHFWRPAAQF ESAROVTLFP 660 
DKIVSTFAK TEAPGDVKTT DPRSEVLEEM LHNL0FCDIL V1GGDLDFRQ BCLELNHSEL 720 
HQRHLDATNS TAGYSIYG VG SMSRYEQVLH HIRYRNWRPA SLEARRFRDC CSEJLNGRYTS 780 
NEFNLEVSEL HEDQVSDKBH VNHLIVQPPF LQSVHHPESR SSIQHSSWP SIATWIHS 840 
VCMLVFW AM GVYRVRIAHQ HFIQBTEAAK ESEMDWDDSA LTTTVNPMEK. HEGPGHGEDB 900 
TEGEEEEEAE EEMSSSSGSD DSEEEEBEEG MGRGRHGQNG ARQAQLEWDD STLPY 

Seq ID NO: 37 DMA sequence 

Nucleic Add Accessbo #: none found 

Coding sequence: 143-874 (anderimed sequences couespond to start and stop codous) 
1 11 21 31 41 51 

GGGAGGGAGA GAGGCGCGCG GGTGAAAGGC GCATTGATGC AGCCTGCGGC GGCCTOGGAG 60 
GGCGGCGGAG CCAGACGCTG ACCACGTTCClCIUjrCGaT CTCCTCOGCC TOCAGCTCCG 120 
CGCTOCCCGO CAGCCGGGAO CCATgCGACC CCAGGGCCCC GCCGCCICC C CGCAG OGGCT 180 
CCGCGGCCTC ClliCTGCTCC TGCTOCTGCA GCTGCCCGCO CCGTCGAGCG CCTCTGAGAT 240 
COCCAAGGGG AAGCAAAAGG CGCAGCTCCG GCAGAGGGAG GTGGTOGACC TGTATAATCG 300 
AATGTGCTTA CAAGGGCCAG CAGGAGTGCC TGGTOGAGAC GGG AGCCCTG GGGCCAATGG 360 
CATTOCGOOT ACACCTGGQA TCCCAGGTCG GGATGOATTC AAAGGAGAAA AGGGGGAATO 420 
TCTGAGGGAA AGCTTTGAGQ AGTCCTGG AC ACCCAACTAC AAGCAGTGTT CATQG AGTTC 480 
ATTG AATTAT GGCATAGATC TTGGGAAAAT TGCGGAGTGT ACATTTACAA AGATGCGTTC 540 
AAATAGTGCT CTAAGAGTTT TGTTCAGTGG CTCACTTCGG CTAAAATGCA GAAATGCATG 600 
CTGTCAGCGT TGGTATTTCA CATTCAATGG AGCTGAATGT TCAGGACCTC TTCCCATTGA 660 
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CAACCTTAAA AAAAAAAAAA AAAA 

SeqlDNOe 38 Protein seonence: 
^ PmtciD Accession #: nonefbund 

t 11 21 31 41 51 

MRPOGPAASP QRLRGLJLLLL LLQLPAPSSA SEIPKGKQICA QLRQRBWDL YNGMCLQOPA 60 
GVPGRDGSPG ANGIPGTPGI PGRDGFKGEK GECLRESFEE SWTPNYKQCS WSSLNYGH)L 120 
20 GK1AECTFTK MRSNSAIJfcVL FSGSLRLKCR NACCQRWYFT FNG AECSGPL PIBAIIYLDQ 180 
GSPEMNSITN IHRTSSVBGL CEGIOAGLVD VAIWVGTCSD YPKGDASTGW NSVSRDIEB 240 
LPK 

_ Seq JDD NO: 39 DMA sequence 

25 Nudeic Add Accessfcm #: NM 000949 

Coding sequence: 285-21 53(underiined sequences correspond to start and stop cottons) 

1 11 21 31 41 51 

30 GGAGGCTGAA ATCCCCAGAC GCCGGTTTTC TGGGCTQGGC TTTCTGCTTA CTCACTCCTT 60 
CTCCCTCnT CTGGATTTTA CCQACCGTTC GCGAAACAGC TTTCCACACA ATGGAGCTTC 120 
ATOTCCTCOT GCAOGAAGTA CTCATCCACT GATGTGGCAG ACTTTGCTCC CTGACAAAAC 180 
TAAAG AACTC TOCTATTCAT GGAGGCGAAC ACTGAGGATG CTTTCCACAT G AACCCTGAA 240 
GTGAACTTCT GATACATTTC CTGCAGCAAG AGAAGGCAGC CAACATOAAG GAAAATGTGG 300 

35 CATCTGCAAC CGTnTCACT CTGCTACTTT TTCTCAACAC CTGCCTTCTG AATGGACAGT 360 
TACCTCCTGG AAAACCTG AG ATCTTTAAAT GTCGTTCTCC CAATAAGGAA ACATTCACCT 420 
GCTGGTGGAG GCCTGGGACA GATGG AGGAC TTCCTACCAA TTATTCACTG ACTTACCACA 480 
GGGAAGGAGA GACACTCATG CATGAATGTC CAGACTACAT AACCGGTGGC CCCAACTCCT 540 
GCCACTTTOG CAAGCAGTAC ACCTCCATGT GGAGGACATA CATCATGATG GTCAATGCCA 600 

40 CTAACCAGAT GGGAAGCAGT 7/TCTCGGATG AACTTTATGT GGACOTGACT TACATAGTTC 660 
AGCCAGACCC TCCTTTGGAG CTGGCTGTGG AAGTAAAACA GCCAGAAGAC AGAAA ACCCT 720 
ACCTGTGGAT TAAATGGTCT CCACCTACCC TGATTGACTT AAAAACTGGT TG GTTCAC GC 780 
TCCTGTATGA AATTCGATTA AAAOCCGAGA AAGCAGCTGA GTGGG AGATC CATTTT GCTO 840 
GGCAGCAAAC AGAGTTTAAG ATTCTCAGCC TACATCCAGG ACAGAAATAC CTTGTCCAGG 900 

45 TTCGCTGCAA ACCAG ACCAT GGATACTGGA GTGCATGGAG TCCAGCGACX TT CATTCAG A 960 
TAOCTAGTG A CTTCA(XATG AATGATACAA CCGTGTGGAT CTCTGTGGCT GTCCTTTCTG 1020 
CTGTCATCTG TTTGATTATT GTCTGGGCAG TGGCTITGAA GGGCTATAGC ATGGTOA CCT 1080 
GCATCTTTOC GCCAGTTCCT GGGCCAAAAA TAAAAGOATT TGATGCTCAT CTGTTGGAGA 1140 
AGGGCAAGTC TGAAGAACTA CTG AGTGCCT TGGGATGCCA AGACTTTCCT (XCAOTCTG 1200 

50 ACTATGAGGA CTTGCTGGTG GAGTATTTAG AAGTAGATGA TAGTGAGGAC CAGCATCTAA 1260 
TGTCAGTCCA TTCAAAAGAA CACCCAAGTC AAGGTATO AA ACCCACATAC CTGGATCCTG 1320 
ACACTGACTC AGGCCGGGGG AGCTGTGACA GCCCTTCCCT TTTGTCTGAA AAGTGTGAGO 1380 
AACCCCAGGC CAATOCCICC ACATTCTATG ATCCTGAGGT CATTGAGAAG CCAGAGAATC 1440 
CIX2AAACAAC CX^CACCTGG GACOOCCAGT GCATAAGCAT GGAAGGCAAA ATCOCCTA7T 1500 

55 TTCATGCTGG TGGATCCAAA TGTTCAACAT GGCCCITACC ACAGCCCAGC CAGCACAACO 1560 
CCAGATOCTC TTACCACAAT ATTACTQATG TGTGTGAGCT GGCTGTGGGC CCTGCAGGTG 1620 
CACCGGCCAC TCTGTTGAAT OAAGCAGGTA AAGATGCTTT AAAATCCTCT CAAACCATTA 1680 
AGTCTAGAGA AGAGGGAAAG GCAACCCAGC AGAGGGAGGT AG AAAGC TTC CATTCTGAGA 1740 
CTGACCAGGA TACOCCCTGG CTGCTGCCCC AGGAGAAAAC CCCCTTTCGC TCCGCTAAAC 1800 

60 CCTTOGATTA TGTGGAG ATT CACAAGGTCA ACAAAGATGG TGCATTATCA TTGCTACCAA I860 

AACAG AGAGA GAACAGCGGC AAGCCCAAGA AGCXXXK3GAC TCCTGAGAAC AATAAGGAGT 1920 
ATGCCAAGGT GTCCGGGGTC ATGGATAACA ACATOCTGGT GTTGGTGCCA GATCCACATG 1980 
CTAAAAACGT GGCTTGCTTT GAAG AATCAG CTAAAGAGGC CCCACCATCA CTTGAACAGA 2040 
ATCAAGCTGA GAAAGCCCTG GCCAACTTCA CTGCAACATC AAGCAAGTGC AGGCTCCAGC 2100 

65 TGGGTGGTTT GGATTACCTO GATCXXX5CAT GTTTTACACA CTCCTTTCAC 22ATAGCTTG 2160 
ACTAATGGAA TG ATTGGTTA AAATGTGATT 111CMCAGG TAACACTACA GAGTACGTGA 2220 
AATOCTCAAG AATGTAGTCA GACTG ACACT ACTAAAGCTC CCAGCTCCTT TCATGCTCCA 2280 
TTTTTAACCA CTTGCXTTCTT TCTCCAGCAG CTGATTCCAG AACAAATCAT TATGTTTOCT 2340 
AACTGTCATT TGTAGATTTA CrTTTTGCTG TTAGTTATAA AACTATGTGT TCAATGAAAT 2400 

70 AAAAGCACAC TGCTTAGTAT TCTTG AGGGA CAATGCCAAT AGGTATATCC TCTGGAAAAG 2460 
GCTTTCATOA TTTGGCATGG GACAG ACGGA AATOAAATTG TCAAAATTGT TTACCATAGA 2520 
AAGATGACAA AAGAAAATTT TCCACATAGG AAAATGCCAT G AAAATTGCT TTTG AAAAAC 2580 
AACTGCATAA CCTTTACACT CCTCGTCCAT TTTATTAGG A TTACCCAAAT ATAA(XATTT 2640 
AAAGAAAGAA TGCATTCCAG AACAAATTGT TTACATAAGT TCCTATACCT TACTGACACA 2700 

75 TTGCTGATAT GCAAGTAAGA AAT 

c n ^Mr^imProtetnscqueace: 
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Protein Accession #: NP_000940.1 

1 U 21 31 41 51 

5 MKENVASATV PTLLLFLNTC LLNGQLPPGK PEIFKCRSPN KETFTCWWRP GTDGGLPTNY 60 

SLTYHREGET LMHECPDY1T GGPNSCHFGK QYT»fWRTYI MMVNATNQMG SSFSDELYVD 120 
VTYIVOPDPP LELAVEVKQP EDRKPYLWIK WSFPTLIDLK TGWFTLLYEI RLKPEKAAEW 180 
HHFAGQQTE FKDLSLHPGQ KYLVQVRCKP DHOYWSAWSP ATFIQIPSDF TMNDTTVW1S 240 
VAVLSAVICL ITVWA VAUCG YSMVTCIFPP VPGPKDCGFD AHLLEKGKSB HiSALGCQD 300 

10 FPPTSDYHDL LVEYLEVDDS EDQHLMSVHS KEHPSQGMKP TYLDPDTTDSG RGSCDSPSLL 360 
SEKCEEPQAN PSTFYDPEVI HKPENPETTH TWDPQOSME GKIPYFHAGG SKCSTWPLPQ 420 
PSQHNPRSSY HNTTD VCELA VGP AG APATL LNEAGKDALK SSQTKSREB GKATQQREVE 480 
SFHSBTIX^)T PWLLPQEKTP FGSAKPLDYV EIHKVNKDGA LSLLPKQREN SUKFKKJVIP 540 
ENNKEYAKVS G VMDNNILVL VPDPHAKNVA CFKESAKEAP PSLEQNQAEK ALANPTATSS 600 

1 5 KOOQLGGLD YLDPACFTHS FH 



20 
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SeqIDNOMlDNAs 

Nucleic Acid Accession #: none found, Eos cloned sequence 

Coding sequence: i-1572 (underlined sequences correspond to start and stopcodons) 



1 11 21 31 41 51 
I I 1 I I I 

ATG ACCCAAA ATAAATTAAA GCTTTGTTOC AAAGOCAATG TGTATACTGA AGTGOCTGAT 60 
GGAGGATGGG GCTGGGCGGT AGCTGTTTCA 1111 ICHC G TTG AAGTCT T CAOCTAOGGC 120 
25 ATCATCAAGA CATTTGGTGT CTTCTTTAAT GACTTAATGG ACAGTTTTAA TG AATCCAAT 180 
AGCAGGATCT CATGGATAAT CTCAATCTGT GTGTTTGTCT TAACATTTTC AGCTCCCCTC 240 
GCCACAGTCC TGAGCAATCG TTTCGGACAC CGTCTGGTAG TGATGTTGGG GGGGCTACTT 300 
GTCAGCACCG GGATGGTGGC CGCCTCCTTC TCACAAGAGG TTTCTCATAT GTACGTCGCC 360 
ATCGGCATCA TCTCTGGTCT GGGATACTGC TTtAGTTTTC TCCCAACTGT AACCATCCTA 420 
30 TCACAATATT TTGGCAAAAG AOGTTOCATA GTCACTGCAG TTGCTTOCAC AGGAGAATGT 480 
TTCGCTGTGT TTGCTTTCGC ACCAGCAATC ATGGCTCTGA AGGAGCGCAT TGGCTGGAGA 540 
TACAGCCTCC TCTTCGTGGG CCTACTACAG TTAAACATTG TCATCTTCGG AGCACTGCTC 600 
AGACCCATCT TTATCAGAGG ACCAGCGTCA CCGAAAATAG TCATCCAGGA AAATOGGAAA 660 
GAAGCGCAGT ATATGCTTGA AAATGAGAAA ACACGAACCT CAATAGACTC CATTGACTCA 720 
3 5 GGAGTAGAAC TAACTAOCTC ACCTAAAAAT GTGCCTACTC ACACTAACCT GGAACTGGAG 780 
CCGAAGGCCG ACATGCAGCA GGTCCTGGTG AAGACCAGCC (XAGGCCAAG CGAAAAGAAA 840 
GCOCOGCTAT TAGACTTCTC CATTTTGAAA GAGAAAAGTT TTATTTGTTA TGCATTATTT 900 
GGTCTCTTTG CAACACTGGG ATTCTTTGCA CCTTCCTTGT ACATCATTCC TCTGGGCATT 960 
AGTCTGGGCA TTGACCAGGA CCGCGCTGCT TTTTTATTAT CTACGATGGC CATTGCAGAA 1020 
40 GTTTTCGGAA GGATCGGAGC TG UTI 1 iiiiV CTCAACAGGG AG CCCAT T OG TAAGATTTAC 1080 
ATTGAGCTCA TCTGCGTCAT CTTATTGACT GTGTCTCTGT TTGCCTTT AC TTITGCTACT 1140 
G AATTCTGGG GTCTAATGTC ATGCAGCATA TTTTTTGGGT TTATOGTTGG AACAATAGG A 1200 
GGGACTCACA TTCCACTGCT TGCTGAGGAT GATGTCGTGG GCATTGAGAA GATGTCTTCT 1260 
GCAGCTGGGG TCTACATCTT CATTCAGAGC ATAGCAGGAC TGGCTGGACC GCCCCTTGCA 1320 
45 GGTTTGTTGG TGGACCAAAG TAAGATCTAC AGCAGGGCCT TCTACTCCTG CGCAGCTGGC 1380 
ATGGCCCTGG CTGCTGTGTG CCTCGCCCTO GTGAGACOGT GTAAGATGGG ACTGTGCCAG 1440 
CATCATCACT CAGGTGAAAC AAAGGTAGTQ AGCCATCGTG GGAAGACTTT ACAGGACATA 1500 
CCTGAAGACT TTCTGGAAAT GGATCTTGCA AAAAATGAGC ACAGAGTTCA CGTGCAAATG 1560 
GAGCCGGTAZSA 



Sm rDNO: 42 Protein sequence: 

Protein Accession ft none found, Eos cloned sequence 



1 II 21 31 41 51 

55 i i i i i | 

MTQNKLKIjCS KANVYTEVPD GGWGWAVAYS FFFVEVFTYG DKTFGVFFN DLMDSFNESN 60 
SRISWHSIC VFVLTFSAPL ATVLSNRFGH RLWMLGGLL VSTGMVAASF SQHVSHMYVA 120 
IGTJSGIXjYC FSFIfTVTKL SQYFGKRRSI VTAVASTGBC FAVFAFAPAI MALKERIGWR 180 
YS1XFVGLLQ LNMFG ALL RPIFTRGPAS PKIVIQEKRK EAQYMLENEK TRTSIDSIDS 240 
60 GVELTTSPKN VPTHTNLELE PKADMQQ VLV KTSPRPSEKK APLLDFSILK EKSFICYALF 300 
GLFATLGFFA PSLYIIPLGI SLGIDQDRAA FLLSTMAIAB VFGRIGAGFV LNREPIRKIY 360 
IEUCVILLT VSLFAFTFAT EFWGLMSCSI FFGFMVGTIG GTHIPLLAED DWGIEKMSS 420 
AAGVYIF1QS IAGLAGPPLA GLLVDQSKIY SRAFYSCAAG MALAAVCLAL VRPCKMGLCQ 480 
HHHSGETKW SHRGKTLQDI PEDFLBMDLA KNEHRVHVQM EPV 



Seq ID NO: 43 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted ORF 

CofUngseqnence: 1-1749 (underlined sequences correspond to start and stop codons) 



70 I 11 21 31 41 51 
I I I I I I 

ATGCTGTCTG GCTTCTTGAT GAGTCCCAGT ACCCAGCACA GAGCACAGTA CACTCCCGGA 60 
GGAAAGAAAC TTCCGTGGG A GGCTTCCATC GGTGCGCACA CCTCCCGAGG GCGAGGCAGC 120 
GACCGGGAGA GGGAGAGCCG GCCGGAGGCT GCCGGGCTCC TGTGGGACCG CGCTGCAGCC 180 
75 GGGGAGGCGG AGAAGGGGAA CCGGGGCGAG CCGCCCGCCT GGATCCGCGC CCAGCAGCAG 240 
CCGCGGCCGC OGCCAGCTGG GCAGGCTOCC GGG ACTGCGG CTGGGGGCGC GCAGGACCCT 300 
CGCCTGCGTC CTGGACGTTC CCGGGGGAGG GTCCGGTTGC CAGTGAAACC TCCAGAGGCT 360 
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TCCGGACGAC AGCCCCGGGG GCCTTCTGAC TGCATCCCGA G ATTTCCATC AGOG AOTOCA 420 
ACTCATAAGG CAGTOCCTAA GGGGACCGGG CCAOCGGCTG AGGACGGGGA TGGCTTAGGA 480 
GCTCCTGGAC CTAGGGCO0G GOGTOGTOGC CTOCTGGGOG TCGCGGCAGA GGGGAGTGGC 540 
CCGCGCGGAA AGCGCCGCGG GACAGTCAGT GACGAGGCCC GGGGGTOGCC GGGGOCAOGA 600 
CTTCTCGGAG ACCGTCCTGC GCTCTCTGGA GACGCGCTGT CCGCGCCCAG GGTGGTGCCA 660 
TGTGGGGCGC TCGCCGCTOG TCCGTCTCCT CATCCTGG AA CXJCCGCTTCG CTCCTGCAGC 720 
TGCTGCTGGC TGOGCTGCTG GCGGOGGGGG OGAGGGOOCA GCGGOGAGTA CTGCCACGGC 780 
TGGCTGGACG CGGAGGGCGT CTGGCGCATC GGCTIGCAGT GTOOCGAGCG CTTCGACGGC 840 
GGCGACGCCA CCATCTGCTG CGGCAGCTGC GCGTTGCGCT ACTGCTGCTC CAGCGCCGAG 900 
GGGOGOCTGG ACCAGGGGGG CTGOG ACAAT G ACCGCCAGC AGGGOGCTGG GGAGOCTGGC 960 
CGGGCGGACA AAGACGGGCC CCGACGGCTC GGCAGGGCTT CATGTCTTAO GGGTAOOCAA 1020 
GGAGACGGCG AGGGTGCGCC CCCACCCGTG AGGGCCTGGC AGCGGTGCTC CCCTGAAGGC 1080 
TCCCCGAAAG G AAGGCAGCT CCTCAGGGCT TTCCCGGGGC TGCTGCCCCG TGOCAGACGC 1140 
CGCGGATTCC CATCTTCTCC ACGCGGCGGC CCCTCTCCCC TGCAGCGGCC CGOCTTGCCC 1200 
ATCTACGTGC CGTTCCTCAT TGTTGGCTCC GTGTTTGTCG CCTTTATCAT CTTGGQGTCC 1260 
CTGGTGGCAG CCTGTTOCTO CAGATGTCTC CGGCCTAAGC AGGATCCCCA GCAGAGCCGA 1320 
GCCCCAGGGG GTAACCGCTT G ATGGAGACC ATCCOCATGA TOOCCAGTGC CAGCACCTOC 1380 
CGGGGGTCGT CCTCACGCCA GTCCAGCACA GCTGCCAGTT CCAGCTCCAG CGCCAACTCC 1440 
GGGGCCCGGG CGCCOCCAAC AAGGTCACAG ACCAACTGTT GCTTGCCGGA AGGGACCATG 1500 
AACAACGTGT ATGTCAACAT GCCCACGAAT TTCTCTGTGC TGAACTGTCA GCAGGCCACC 1560 
CAGATTGTGC CACATCAAGG GCAGTATCTG CATCCCCCAT ACGTGGGGTA CACGGTGCAG 1620 
CACGACTCTG TGCCCATGAC AGCTGTGCCA CCTTTCATGG ACGGCCTGCA GCCTGGCTAC 1680 
AGGCAGATTC AGTCOCCCTT CCCTCACACG AACAGTGAAC AGAAGATGTA CCCAGCGGTO 1740 
ACTGTATAAC CGAGAGTCAC TGGTGGGTTC CTTTACTGAA GGGAGACGAA GGCAGGGGTG 1800 
GATTCTOGAG GTGGAAGTCC GCACATGTCG GTGGTATTTA TGGGAOGATT CCTTTGGATG 1860 
GCTTCATTTG CCCCCAGACT GTATGAAAAC ATCTCCGAAT TAGCATTTCT GGATATGTTT 1920 
CATCCAGGGT ATCATTQATT TATGATGGAA AACCGGCCTC AGCTGGAGAT GACTGTGATG 1980 
TTGCT GATGG GTGTATAACA AATGCTTQAG TOCO AAGTGC CCTTOAGATA TGGTTGACGA 2040 
AAGAATTTTA TAAACTGATA AATTAAGGAT TTTTATTATG TTGTTATTAT TATTTCTTTT 2100 
TTG 1 101 1GA CTGCACA QOA TCAAAATGCC TGTTATCTCC CTTTTACTGG GACTTTTTTT 2160 
TTTTTTTTTr TTTTTTTTAA TCAGACAGGG TCTTGCTCTG TTGCOCAGGC TGG AGTGCAG 2220 
TGGTGCGATC TCGGCTCACT GCAACTTCAG CCTCCTGGAT TCAGGCAACA CTCCTGCCTC 2280 
AGCCTCCCAC GTGGCTGGGA TTACAGGTGC CTGCCCCCAT GGCTAATTTT TTGTATTTTT 2340 
TGTAGAGATG GGGTTTCACC ATGTTGGCTG GGCTGGTCTC ACTCTCCTGA CCTCAAGCAA 2400 
TCTGCCTGTC TCAGCCTCCC AAAOTGCFGG G ATTACAGGC GTGAGCCACC GCCCCCAGCC 2460 
TGAGCClill II 1 1111 CTA ATGCATCCAA GGTTAAGGGG AAGACGCAAA TAACAGGACT 2520 
ATTCTAAAAG GAAACCTGTT TGAACTCTG7 GAGATCAGTC ATCAGTCTCA GTATTCCACA 2580 
GGCACACCTT AATTTCATTG TAAAAAGATA TATATATTTT GTCTATTTTT GTGCTTTTGG 2640 
GGGCCTATTT TGTGCTTTTT TACCTTATGT AG AGATCTTA TTACAAAGTG ATTTTCTACA 2700 
TTAAAAAGAG ACTGAAATAA ATTGTATAGT TACTTAACTA ATGAAGACAT TTCAGAACTC 2760 
TGGGATGATT TTAATCTTOA AGTAGTAGGT GGTATAGTCA TAAAACCATT CATCCCCTTC 2820 
TTGATTGTAT CTTAATTTTC TGGCTTTAAG GTGACATCTO AGAGGTAATG CATTCTTTTT 2880 . 
TATATTQAAA TCATAAACTA TCACCCGCTG CI ICICI U AO TTACTTTTAA 11 11GCLT1U 2940 
TGGTTATGGT TTGGCGTTTC CTTCTGTrrG GTTTTCAGAG CCCCATGTCT ATATAGTCCT 3000 
GAGTGCAAGT AATTACTATA CTTGTAAATG AAOATCAGTA TTTCTGCCTA GATCTGATAA 3060 
AAAAATTTTC TTGTCTTAGT TATAAAAATT CAAAG AAATG TGTTACAAAG ATACTTAGTA 3120 
TAGCTCCTCA GCCATAACCT GAGACTTGGG ATGAAATTTA AACCAG ATAC GATTTACTTT 3180 
GCAGATCATA AGGCTTTTTA TACTCTTGTT ATCAAAATGG CTTATTTTTC AGGCACTAAG 3240 
GATTGTTAAG AGAAAAGCTT TTCAACGAAG GATTOCCTTT CTTCTCCCAC ACTGTTCTTG 3300 
ATTTCCTCTC TCTTTCAGGC CTCAACAGGC ACTGTATTCA TTGCCAATGT TOCAAATTAT 3360 
CAAATTCAAG TGAATTTATT 1G1G1UUC1 TT ACTTATAT AAAAAAAGAT AACTTTAAGG 3420 
ATGTGCAAGT ACATTTCCAA CTGCTAGCAC AACCAGTATT TTGTAATTAA ACAAATCGCT 3480 
GTATGGTATG GTCTTCTACA CATTTATGTC TATAGATATC TATCOATCAT CTTTCTATTC 3540 
TGTTTCATGA CTOAATAATG TAAAACCAGT GTTCGCAATT GGTATCATCA ATGATACTCA 3600 
TTTTTTAATA AOCAAAGGCA GGGG AAAATC ATTTTACTTA TTAATAAATA TTTTATGATG 3660 
TGAAAAAAAA AAAAAAAAAA AAAAAAAAAA 

Seq ID NO: 44 Protein seqcence: 

Protein Accesstoo ft FGENESH predicted 

1 11 21 31 41 51 
) I I I I 1 

MLSGFLMSPS TQHRAQYTPG GKKLPWEASI GAHTSRGRGS DRERESRPHA AGLLWDRAAA 60 
GEAEKGNRGE PPAWIRAQQQ PRPPPAGQAP GTAAGG AQDP RLRPGRSRGR VRLPVKPPHA 120 
SGRQPRGPSD C1PRFPSASA TISCA VPKGTG PPAEDGDGLG APGPRARRRR LLGVAAEGSG 180 
PRGKRRGTVS DEARGSPGPR LLGDRPALSG DAl^APRWP CGA1AARPSP HPGTPLRSCS 240 
CCWLRCWRRG RGPSGEYCHG WLDAQGVWRI GFOCPERFDG GDATICOGSC ALRYOCSSAE 300 
ARLDQGOCDN DRQQGAGEPG RADiaXJPRRL GRASCLRGTQ GDGEGAPPPV RAWQRCSPEG 360 
SPKGRQLLRA FPGLLPRARR RGFPSSPRGG PSPLQRPALP IYVPFLIVGS VFVAFULGS 420 
LVAACCCWX RPKQDPQQSR APGGNRLMET IPMIPSASTS RGSSSRQSCT AASSSSSANS 480 
GARAFPTRSQ TNCCLPBGTM NNVYVNMPTN FSVLNCQQAT QfVPHQGQYL HFPYVGYTVQ 540 
HD5VPMTA VP PFMDGLQPGY ROJQSPFPHT NSEQKMYPAV TV 

Seq ID NO: 45 DNA sequence 

Nncteb Acid Accession ft NMJD02285 

Coding seque n ce: 55-3738 (underiined sequences correspond to start and stop codons) 
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1 II 21 31 41 51 
I I I I I I 

GGCCGAGCCT CGOCGGCGGC GGTAGCGGCG GCGGCG ACGC TGACACCTCC CACCATGOAC 60 
AGCTTCGACT TAGCCCTGCT (XAGGAATGG GACCTCGAGT CACTGTGTGT CTATGAACCA 120 
GATAGAAATG CATTACGGAG GAAAGAACGA GAAAG AAGAA ATCAAGAAAC TCAACAGGAT 180 
GATOGCACXTT TTAATTCTAG TTACTCTCTC TTCAGTGAGC CCTACAAGAC TAACAAGGGG 240 
GATGAACTCT CCAACCGGAT CCAGAACACT TTAGGCAATT ATGATGAAAT GAAAGACTTT 300 
TTAACTGATA GAACCAATCA G AOTCATCTC GTTGG AGTTC CCAAACCTGG GGTTCCTCAG 360 
ACTCCTGTGA ACAAGATCGA TGAACATTTT GTTGCAGATT CAAGAGCCCA GAACCAGCCC 420 
TCGTCTATCT GTAGCACTAC AACTTCCACA CCAGCAGCTG TCCCCGTGCA GCAGAGTAAA 480 
AGAGGCACTA TGGGCTGGCA G AAGGCTGGG CACCCACCCT CTGACGGCCA ACAGAGAGCA 540 
ACACAACAGG GCTCTCTCAG GACC J ltiC l 1 GOAGATGGTG TTOGGAGACA GCAGCCTCGG 600 
GCCAAACAAG TGTGCAATGT GGAGGTGGGC CTTCAGACCC AGGAGAGGCC ACCTGCCATG 660 
GOGGOCAAGC ACAGCAGCAG COG ACACTGT GTTCAGAACT TTCCTCCATC CCTAQCTTCA 720 
AAACCCAGCC TGGTCCAGCA GAAACCGACC GCGTATGTGA GGCCAATGGA CGGCCAAGAT 780 
CAGGCCCCTG ATGAGTCTCC TAAGCTGAAG TCGTCTTOGG AAACCAGCGT GCACTGCACA 840 
TCATACAGGG GAGTCCCTGC CAGCAAGCCG GAGCCTGCCA G AGCCAAGGC CAAGCTCTCC 900 
AAGTTCAGCA TCOOCAAGCA GGGGGAGG AG AGTAGATCTG GAG AAAOCAA CAGCTGTGTT 960 
G AAG AAATAA TCCGGG AG AT G ACCTGGCTT CCAOCACTIT CTGCTATTCA AGCACCTGGC 1020 
AAAGTGG AAC CAACCAAATT TCCATTTCCA AATAAGGACT CTCAGCTTGT ATCCTCTGG A 1080 
CACAATAATC CAAAOAAAGG TGATGCAGAG CCAGAGAGTC CAGACAATGG CACATCGAAT 1 140 
ACATCAATGC TGGAAGATGA CCTTAAGCTA AGCAGTGATG AAGAGGAGAA TGAACAGCAG 1200 
GCAGCTCAGA GAACGGCTCT CCGCGCTCTC TCTG ACAGCG COGTGGTOCA GCAGCCCAAC 1260 
TGCAGAACCT CGGTGCCTTC CAGCAAGGGC AGCAGCAGCA GCAGCAGCAG CGGCACGAGC 1320 
AGCTCCTCCA GCGACTCAGA G AGCAGCTCC GGATCTGACT CGGAGACCGA GAGCAGCTCC 1380 
AGCGAGAGTG AGGGCAGCAA GCCCCOCCAC TTCTCCAGCC OCGAGGCTGA ACCGGCATCC 1440 
TCTAACAAGT GGCAGCTGGA TAAATGGCTA AACAAAGTTA ATCCCCACAA GCCTCCTATT 1500 
CTGATCCAAA ATGAAAGCCA CGGGTCAGAG AGCAATCAGT ACTACAACCC GGTGAAAGAG 1560 
GACGTCCAGG ACTGTGGGAA AGTCCCCGAC GnTGCCAGC CCAGCCTGAG AGAG AAGGAG 1620 
ATCAAGAGCA CTTGCAAGG A GG AGCAAAGG CCAAGGACAG OCAACAAGGC CCCTGGGAGT 1680 
AAAGGCGTGA AGCAGAAGTC CCCGCCCGCG GCCGTGGCCG TGGCGGTGAG CGCAGCCGCC 1740 
CXXKXACCCG CAGTGCCCTG TGCGCCCGCG GAGAACGOGC CCGCGOCTGC CCGGAGGTCC 1800 
GCGGGCAAGA AGCCCACCAG GCGCACCGAG AGGACCTCAG CCGGGGACGG CGCCAACTGC 1860 
CACCGGCCCG AGGAGCCCGC GGCCGCGGAC GCGCTGGGGA CGAGCGTGGT GGTCCCCCCG 1920 
GAGCCCACCA AAACCAGGCC CTGTGGCAAC AACAGAGCGA GCCACCGCAA GGAGCTGCGC 1980 
TCCTCCGTGA CCTGCGAGAA GCGCCGCACG CGGGGGCTAA GCAGGATCGT CCCCAAATCC 2040 
AAGGAGTTCA TTGAGACAG A GTCGTCATCT TCATOCTDCT CCTCGG ACTC OGACCTGGAG 2100 
TOOGAGCAGG AGGAOTACCC TCTGTOCAAA GCACAGAOCG TGGCTGCCTC TGCCTCCTCC 2160 
GGGAATGATC AGAGGCTGAA GGAGGCCGCT GCCAACGGGG GCAGTGGTCC TAGGGCCCCT 2220 
GTAGGCTCCA TCAACGCCAG G ACCACCAGT GACATOOCCA AGGAGCTGGA GGAGCAGTTC 2280 
TACACACTGG TCCOCTTTGG COGGAACGAA CTTCTCTOOC CICTAAAGG A CAGTGATGAG 2340 
ATCAGGTCTC TCTGGGTCAA AATCGACCTG ACCCTCCTGT CCAGGATCCC AGAACACCTG 2400 
CCCCAGGAGC CAGGGGTATT OAGCGCCCCT GCCACCAAGG ACTCTGAG AG CGCACCGCCC 2460 
AGCCACACCT CGGACACACC TGCAGAAAAG GCTTTGCCAA AATCCAAGAG GAAACGCAAG 2520 
TGTOACAACG AAGACGACTA CAGGGAGATC AAGAAGTCCC AGGGAGAGAA AGACAGCTCT 2580 
TCAAGACTGG CCACCTCCAC CAGTAATACT TTGTCTGCAA ACCACTGCAA CATGAACATC 2640 
AACAGTGTGG CAATACCAAT AAATAAAAAT G AAAAAATGC TTOGGTCGCC CATCTCAOCC 2700 
CTCTCTGATG CATCTAAACA CAAATACACC AGCGAGGACT TAACTTCTTC CAGCCG ACCT 2760 
AATGGCAACA GTTTGTTTAC TTCAGCCTCT TCCAGCAAAA AGCCTAAGGC CGACAGCCAG 2820 
CTGCAGCCTC ACGGCGGAGA CCTCACGAAA GCAGCTCACA ACAATTCTGA AAACATTOCC 2880 
CTCCACAAGT CACGGOCGCA GACGAAGCCG TGGTCTCCAG GCICCAACGG CCACAGGGAC 2940 
TGCAAGAGGC AGAAACTTGT CTTCGATGAT ATGCCTCGCA GTGCCGATTA TTTTATGCAA 3000 
GAAGCTAAAC GAATGAAGCA TAAAGCAGAT GCAATGGTGG AAAAGnTGG AAAGGCTTTG 3060 
AACTATGCTG AAGCAGCATT GTCGTTTATC GAGTGTGGAA ATGCAATGGA ACAAGGOCOC 3120 
ATGGAATCCA AATCTOCTTA TTACCTGATG TATTCAGAAA CAGTAGAGCT CATCAGGTAT 3180 
GCTATGAG AC TAAAAAOCCA CTCAGGCOCC AATGCCACAC CAG AAGACAA ACAACTGGCT 3240 
GCATTATGTT ACCGATGOCT GGCOCTOCTG TACTGGCGGA TGTTTCGACT CAAAAGGGAC 3300 
CACGCTGTAA AGTATTCAAA AGCACTAATC GACTATTTCA AGAACTCATC TAAAGCCGCC 3360 
CAAGCCCCATCTCCGTGGGG GGCCAGTGGA AAGAGCACTG GAACCCCATC CCCCATTTCT 3420 
CCCAACOCCT TTCCCGGCAG CTOCGTGGGG TCTCAGGGCA GCCTCTCCAA CGCCAGCGCC 3480 
CTGTCCCCGT CGACCATCGT CAGCATCCCA CAGCGCATCC ACCAGATGGC GGCCAACCAC 3540 
GTCAGCATCA CCAACAGCAT CCTGCACAGC TACG ACTACT GGGAGATGGC CG ACAAOCTG 3600 
GOCAAGGAAA ACCG AGAATT CTTCAACG AC CTGGATCTGC TCATGGGGCC GGTCACCCTG 3660 
CACAGCAGCA TGGAGCACCT GGTCCAGTAC TCCCAACAGG GCCTGCACTG GCTGCGGAAC 3720 
AGCGCCCACC TGTCAJAGGG ACCTCACCCT GGGGCCAGAG TGGGCTCTGG TCTCCACAGA 3780 
TGGCTCAACG TTTTTGGACA CTGTGCTACT GAAACTTXCA GCCACAGCAT TTATAGACTG 3840 
CGGTGAACATTTCCTCA 

SeqlDNCh 46 Protein sequencer 
Protein Accesskm ff: NP 002276 

1 II 21 31 41 51 
I I I I I I 

MDSFDLALLQ EWDLESIjCVY EPDRNALRRK ERERRNQETQ QDDGTFNSSY SLFSEPYKTN 60 
KGDELSNRIQ NTLGNYDEMK DFLTDRTNQS HLVGVPKPGV PQTPVNKIDE HFVADSRAQN 120 
QPSSICSTTT STPAAVPVQQ SKROTMGWQK AGHPPSDGQQ RATQQGSLRT LLGDGVGRQQ 180 
PRAKQVCKVE VGLQTQERPP AMAAKHSSSO HCVQNFPPSL ASKPSLVQQK PTAYVRPMDG 240 
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QDQAPDESPK LKSSSETSVH CTSYRGVPAS KPEPARAKAK LSKFSIPKQG EESRSGETNS 300 
CVEEDREMT WLPPLSA1QA PGKVEPTKFP FPNKDSQLVS SGHNNPKXGD AEPESPDNGT 360 
SNTSMLEDDL KLSSDEEENE QQAAQRTALR ALSDSA WQQ PNCRTSVPSS KGSSSSSSSG 420 
TSSSSSDSES SSGSDSBTES SSSBSEGSKP PHFSSPEAEP ASSNKWQLDK WLNKVNPHKP 480 
PUJQNESHG SESNQYYNPV KEDVQDCGKV PDVCQPSLRB KEKSTCKEE QRPRTANKAP 540 
GSKGVKQK5P PAAVAVAVSA AAPPPAVPCA PAENAPAPAR RSAGKKPTRR TERTSAGDGA 600 
NCHRPEEPAA ADALGTSVW PPEPTKTRPC GNKRASHRKB LRSSVTCEKR RTRGLSRIVP 660 
KSKEFDSTES SSSSSSSDSD LESEQEEYPL SKAQTVAASA SSGNDQRLKB AAANGGSGPR 720 
APVGSINART TSDIAKELEB QFYTLVPFGR NELLSPLKDS DHRSLWVKI DLHXSRIPB 780 
HLPQEPGVLS APATKDSESA PPSHTSDTPA EKALPKSKRK RKCDNEEDYR EIKXSQGHKD 840 
SSSRLATSTS NTLSANHCNM NINSVAIP1N KNEKMLRSPI SPLSDASKHK YTSEDLTSSS 900 
RPNGNSLFTS ASSSKKPKAD SQLQPBGGDL TKAAHNNSEN IPLHKSRPOT KPWSPGSNGH 960 
RDCKRQKLVF DDMPRSADYF MQEAKRMKHK ADAMVEKFGK ALNYAEAALS FIECGNAMEQ 1020 
GPMESXSFYY LMYSETVELI RYAMRLK.THS GPNATPEDKQ LAALCYRCLA LLYWRMFRLK 1080 
RDHAVKYSKA UDYFKNSSK AAQAPSPWGA SGKSTGTPSP ISPNPFPGSS VGSQGSLSMA 1140 
SALSPSmVS IPQRIHQMAA NHVSITNSIL HSYDYWEMAD NLAKENRBFF NDLDLLMGPV 1200 
TLHSSMEHLV QYSQQGLHWL RNSAHLS 

Seq ID NO: 47 DN A sequence 

Nuddc Acid Accession #: NM.033151 

Coding sequence: 35 1 -4499(undertined sequeoces correspond to start and stop codons) 



1 11 21 31 41 51 

ACTGGG ATAA AGCAAGAAGA CTGATTTTAT GAGCAGGGGT TTGATACATC AAAGG AGATT 60 
GCCCAGGATC AAGGGTGCGG TGTTGGGGOT GGGTTGGGGA GGGTGGTTAG AGAAGGTTTC 120 
ACTAAGTGAT TTGGGCCTGA GGCCTGAGAA GATGTTTAAA AAGAGGGATC AAGCACAGGC 180 
TAAGGAG AGG AAAGAGCAGG CACCCAAACC TCTGCATGGC CCCAATATGC TCCCTGCAGG 240 
GTAGTGOCCC ClldTCTGGC TGCTCAAGGC GAGATCTAAG CTIUI IClAA CTCCTGCTQT 300 
CTTTTCATAT TCTCTGATTC TGGGAAACX3A AGAATTCGCA GGAACTGAAA ATGA CTAGGA 360 
AGAGGACATA CTGGGTGCCC AACTCTTCTG GTGGCCTCGT GAATCGTGGC ATCGACATAG 420 
GCGATGACAT GGTTTCAGGA CTTATTTATA AAACCTATAC TCTCCAAGAT GGOCCCTGGA 480 
GTCAGCAAG A G AGAAATCCT GAGGCTCCAG GGAGGGCAGC TGTCCCACCG TGGGGGAAGT 540 
ATGATGCTGC CTTGAGAACC ATGATTCCCT TOCGTCCCAA GCCGAGGTTT CCTGCCCCCC 600 
AQCCCCTOGA CAATGCTGGC C1GTTCTCCT ACCTCACCGT GTCATGGCTC ACCCCGCTCA 660 
TGATCCAAAG CTTACGGAGT CGCTTAGATG AGAACACCAT OXTCCACTG TCAQTCCATG 720 
ATGCCTCAGA CAAAAATGTC CAAAGGCTTC ACCGCCTTTG GGAAGAAGAA GTCTCAAGGC 780 
GAGGGATTGA AAAAGCTTCA GTGCTTCTGG TGATGCTGAG GTTCCAGAGA ACAAGGTTGA 840 
TTTTCGATGC ACTTCTGCGC A1C 1 GCTICT GCATTGCCAG TGTACTCGGG (XAATATTGA 900 
TTATACCAAA GATCCTGGAA TATTCAGAAG AGCAGTTGGO GAATGTTGTC CATGGAGTGG 960 
GAC I C m cif TGCXXTTTTTCTCTCCGAAT GTGTGAAGTC TCTGAGTITC TCCTCCAGTT 1020 
GGATCATCAA CCAACGCACA GCCATCAGGT TXXX3AGCAGC TGTTTCCTCC TTTGCCTTTG 1080 
AGAAGCTCAT CCAATTTAAG TCTGTAATAC ACATCACXTTC AGGAGAGGCC ATCAGCITCT 1140 
TCACCGGTGA TGTAAACTAC CTGTTTGAAG GGGTGTGCTA TGGACCCCTA GTACTGATCA 1200 
CCTGCGCATC GCTGGTCATC TOCAGCATTT CTTCCTACTT CATTATTGGA TACACTGCAT 1260 
TTATTGCCAT CTTATGCTAT CTCCTGGTTT TCCCACTGGC GGTATTCATG ACAAG AATGG 1320 
CTGTG AAGGC TCAGCATCAC ACATCTGAGG TCAGCGACCA GCGCATCCGT GTGAOCAGTG 1380 
AAGTTCTCAC TTGCATTAAG CTGATTAAAA TGTACACATG GGAGAAACCA TTTGCAAAAA 1440 
TCATTGAAGA CCTAAGAAGG AAGGAAAGGA AACTATTGGA GAAGTGCGGG CTTGTCCAGA 1500 
GCCTGACAAG TATAACCTTG TTCATCATCC CCACAGTOOC CACAOOGQTC TGGOTTCTCA 1560 
TCCACACATC CTTAAAGCTG AAACTCACAG CGTCAATGGC CTTCAGCATG CTGGCCTCCT 1620 
TGAATCTCCT TCGOCTOTCA GT U 1 1 C I 1 1& TGCCTATTGC AGTCAAAGGT CTCACGAATT 1680 
CCAAGTCTGC AGTGATGAGG TTCAAGAAGT TTTTCCTCCA GGAGAGCCCT GTTTTCTATG 1740 
TCCAGACATT ACAAGACCCC AGCAAAGCTC TGGTCTTTGA GGAGGCCACC TTGTCATGGC 1800 
AACAG ACCTG TCCCGGGATC GTCAATGGGG CACTGGAGCT GGAGAGGAAC GGGCATGCTT I860 
CTOAGGGGAT GACCAGOCCT AGAGATGCCC TCGGGCCAGA GOAAGAAGGG AACAGCCTGO 1920 
GOCCAGAGTT GCACAAGATC AACCTGGTGG TGTCCAAGGG GATGATOTTA OOGQT CTOCO 1980 
GCAACACGGG GAGTGGTAAG AGCAGCCTGT TGTCAGCCAT CCTGGAGGAG ATGCACTTGC 2040 
TCGAGGGCTC GGTGGGGGTG CAGGGAAGCC TGGCCTATGT CCCCCAGCAG GCCTGGATCG 2100 
TCAGCGGGAA CATCAGGGAG AACATCCTCA TGGGAGGCGC ATATGACA AQ GCC COATACC 2160 
TCCAGGTGCT CCACTGCTGC TCCCTGAATC GGGACCTGGA ACTTCTGCCC TTTGGAGACA 2220 
TGACAOAGAT TGGAGAGCGG GGCCTCAACCTCTCrGGGGG GCAGAAACA G AGGATC ACCC 2280 
TGGCCCGCGC CGTCTATTOC GACCGTCAGA TCTACCTGCT GGACGACCCC CTGTCTGCTG 2340 
TGGACGCCCA CGTGGGGAAG CACATTTTIO AGGAGTGCAT TAAGAAGACA CTCAGGGGGA 2400 
AGACGGTCGT CCTGGTGACC CACCAGCTGC AGTACTTAGA ATnTGTGGC CAGATCATTT 2460 
TGTTGGAAAA TGGG AAAATC TGTOAAAATG GAACICACAO TGAQTTAATG C AGAAA AAGG 2520 
GGAAATATGC CCAACTTATC CAGAAGATGC ACAAGGAAGC CA CTTCGG AC ATOTTGCAGG 2580 
ACACAGCAAA GATAGCAGAG AAGCCAAAGG TAGAAAGTCA GGCTCTGGCC ACCTCCCTGG 2640 
AAGAGTCTCT CAACGGAAAT GCTGTGCXX5G AGCATCAGCT CACACAGGAG GAGGAGATGG 2700 
AAGAAGGCTC CTTGAGTTGG AGGGTCTACC ACCACTACAT CCAGGCAOCT GGAO OTTACA 2760 
TGGTCTCTTG CATAATTTTC TTCTTCGTGG TGCTGATCGT CTTCTTAACG ATCTTCAGCT 2820 
TCTGGTGGCT GAGCTACTGG TTGGAGCAGG GCTCGGGGAC CAATAOCAGC CGAGAGAGCA 2880 
ATGGAACCAT GGCAGACCTG GGCAACATTG CAGACAATCC TCAACTGTOC TTCTACXAGC 2940 
TGGTGTACGG GCTCAACGCC CTGCTCCTCA TCTGTGTGGG GGTCTGCTCC TCAGGGATTT 3 000 
TCACCAAAGT CACGAGGAAG GCATCCACGG CCCTGCACAA CAAGCTCTTC AACAAGGTTT 3060 
TCCGCTGCCC CATG AGTTTC TTTGACACCA TCCCAATAGG CXXKjCTTTTG AACTGCTTCG 3120 
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CAGGGGACTT OGAACAGCTG GACCAGCTCT TGCCCATCTT TTCAGAOCAG TTCCTGGTCC 3180 
TGTDCTTAAT GGTG ATCGCC GTCCTGTTG A TTGTCAGTGT GCTGTCTCCA TATATCCTGT 3240 
TAATGGGAGC CATAATCATG GTTATTTGCT TCATTTATTA TATGATGTTC AAGAAGGCCA 3300 
TCGGTGTGTT CAAGAGACTG GAGAACTATA GCCGGTCTCC TTTATTCTCC CACATCCICA 3350 
- 5 ATTCTCTGCA AGGCCTGAGC TCCATCCATG TCTATGG AAA AACTGAAGAC TTCATCAGCC 3420 
AGTTTAAGAG GCTGACTGAT GCGCAGAATA ACTACCTGCT GTTGTTTCTA TCTICCACAC 3480 
GATGGATGGC ATTGAGGCTG GAGATCATGA CCAACCTTGT GACCTTGGCT GTTGCCCTGT 3540 
TCGTGGCTTT TGGCATrTCC TCCACCCCCT ACTCCTTTAA AGTCATGGCr GTCAACATCG 3600 
TGCTGCAGCT GGCGTCCAGC TTCCAGGCCA CTGCCCGGAT TGGCTTGGAG ACAGAGGCAC 3660 

1 0 AGTTCAOGGC TGTAGAGAGG ATACTGCAGT ACA7GAAG AT GTGTGTCTCG GAAGCTCCTT 3720 
TACACATGGA AGGCACAAGT TOTCOCCAGG GGTCGCCACA GCATGGGGAA ATCATATTTC 3780 
AGGATTATCA CATOAAATAC AGAQACAACA CACCCAOCGT GCTTCACGGC ATCAACCTGA 3840 
CCATCCGCGG CCACGAAGTG GTGGGCATCG TGGGAAGGAC GGGCTCTGGG AAGTCCTCCT 3900 
TGGGCATGGC TCTCTTCCGC CTGGTGG AGC CCATGGCAGG CCGGATTCTC ATTGAOGGCG 3960 

1 5 TGGACATTTG CAGCATCGGC CTGGAGGACT TGCGGTCCAA GCTCTCAGTG ATOCCTCAAO 4020 
ATCCAGTGCT GCTCTCAGGA ACCATCAG AT TCAACCTAGA TOCCTTTGAC CGTCACACTG 4080 
ACCAGCAGAT CTGGGATGCC TTGGAGAGGA CATTCCTGAC CAAGGCCATC TCAAAGTTOC 4140 
CCAAAAAGCT OCATACAGAT GTGGTGG AAA ACGGTGGAAA CTTCTCTGTG GGGGAGAGGC 4200 
AGCTGCTCTG CATTGCCAGG GCTGTGCTTC GCAACTOCAA GATCATCCTT ATCGATGAAG 4260 

20 CCACAGCCTC CATTGACATG GAG ACAGACA CCCTGATCCA GCGCACAATC CGTGAAGCCT 4320 
TCCAGGGCTG CACCGTGCTC GTCATTGCCC ACCGTGTCAC CACTGTGCTG AACTGTG ACC 4380 
ACATCCTGGT TATGGGCAAT GGGAAGGTGG TAGAATTTGA TCGGCCGGAG GTACTGCGGA 4440 
AGAAGCCTGG GTCATTGTTC GCAGCCCTCA TGGCCACAGC CACTTCTTCA CTGAG ATAAG 4500 
- GAGATGTGGA GACTTCATGG AGGCTGGCAG CTGAGCTCAG AGGTTCACAC AGGTGCAGCT 4560 

25 TCGAGGCCCA CAGTCTGCGA CCTTCTTGTT TGGAGATGAG AACTTCTCCT GGAAGCAGGG 4620 
GTAAATGTAG GGGGGGTGGG GATTGCTGGA TGGAAACCCT GGAATAGGCT ACTTGATGGC 4680 
TCTCAAGACC TTAGAACCCC AGAACCATCT AAGACATGGG ATTCAGTG AT CATGTGGTTC 4740 
TCCTTTTAAC TT ACATGCTG AATAATTTTA TAATAAGGTA AAAGCTTATA GTTTTCTGAT 4800 
CTGTGTTAGA AGTGTTGCAA ATGCTGTACT GACTTTGTAA AATATAAAAC TAAGGAAAAC 4860 

30 tc 

Sea ro HO: 48 Protein 9eoneDce; 
Protein Accession fc NPJ49163.2 

35 MTRKRTYWVFNSSGGLVNRGIDIGDDMVSGUY^ 

NflPFRPKPWTAPQPLDNAGU'SYLT^^ 

VSRRGIEKASVIXVMUU^RTRIJFDAIXGICFCIASV^ 

LSECVKSl^SSSWIINt^AIEFRAAVSS^^ 

VUTCASLVICSISSYFnGYTAFlAIU^riXVlTl^ 
40 Lnai Y T W i iKP F A CTEDIJUtfCERia^LEKCGLV 

IJ^SLNUJU^VFFVPIAVKGLTNSKSAVMRFKKIT^^ 

VKGALELERNGHASEGMTRPRDALGPEEEGKSI^^ 

MHli^SVGVQGSLAYVPQQAWrV5GNIRENIU4 

GUOSGGQKQRISLAIttVYSDRQIYLLD^ 
45 QIIIXENGKICEhKnHSELMQKKGKYAQ 

AVPEH(^TQEEEMEEGSLSWRVYHHYIQAAGGYMVSCIIFFFVVliVF^ 

RESNOTMADLGNIADKPQLSFYQ^ 

FDTmGIUXNCFAGDiJBQIJ>QU^ 
^ KXAIGVFK2UJ2NYSRSPLFSHIl>fSIjQ^ 
50 EIMTNLVTLAVAU^AFGISSTPYSHCVMAVNrVlX^LASSFQATA 

EAPliiMEGTSCFQGWFQHGEnFQDYHMKY^ 

LVEPMAGWUDGVDKSGLEDIilSKLSVIPQDP^ 

SKFPKKUnDVVENGGhOWGERQlXCIARAVU^ 
^ VIAffiOTTVUTCDinLVMaNa^ 

Seq ID NO: 49 DNA sequence 

NucIdcAcM Accession ft NM 033419 

Oxfing sequence: 1 8-980 (andertmcd sequences conespond to start and stop codons) 

60 1 11 21 31 41 51 
1 1 I 1 I I 

CflAfiCCAGGG AGAAAGGATG GCOGGCCTGG CGCCGCGGTT OCTCCTGCTA GCTGCGGCAG 60 
CGGCGCTGGC GAGCGOCTCC CAGGGCGACC GTGAGCCGGT GTACCGCGAC TGCGTACTGC 120 
AGTGCOAAG A GCAGAACTGC TCTGGGGGCG CTCTOAATCA CTTCCGCTCC CGCCAGCCAA 180 

65 TCTACATGAG TCTAGCAGGC TG GACCTGTC GGGACGACTG TAAGTATGAG TGTATGTGGG 240 
TCACCGTTGG GCTCTACCTC CAGGAAGGTC ACAAAGTGCC TCAGTTCCAT GGCAAGTGGC 300 
CCTTCTCCCO GTTCCTGTTC TTTCAAG AGC COGCATCGGC CCnGGCCTCG TTTCTCAATG 360 
GCCTGGCCAG CCTGGTGATG CTCTGCCGCT ACCGCACCTT CXTTGCCAGCC TCCTCCCCCA 420 
TGTACCACAC CTGTGTGGCC TTCGCCTGGG TGTCCCTCAATGCATGGTTC TGGTCCACAG 480 

70 TTTTCCACAC CAGGGACACT GACCTCACAG AGAAAATGGA CTACTTCTGT GCCTCCACTG 540 
TCATCCTACA CTCAATCTAC CTGTGCTGCG TCAGGACCGT GGGGCTGCAG CACCCAGCTG 600 
TGGTCAGTGC CTTCCGGGCT CTCCTGCTGC TCATGCTGAC CGTGCACGTC TCCTACCTGA 660 
GCCTCATCCG CTTCG ACTAT GGCTACAACC TGGTGGCCAA CGTGGCTATT GGCCTGGTCA 720 
ACGTGGTGTG GTGGCTGGCC TGGTGCCTGT GGAACCAGCO GCGGCTGCCT CACGTGCGCA 780 

75 AGTGCGTGGT GGTGGTCTTG CTGCTGCAGG GGCTGTCCCT GCTCGAGCTG CTTGACTTCC 840 
CACOGCTCTT CTGGGTCCTG G ATGCCCATG CCATCTGGCA CATCAGCACC ATCCCTGTCC 900 
ACGTCCTCTT TTTCAGCTTT CTGG AAGATG ACAGCCTGTA CCTGCTGAAG GAATCAGAGG 960 
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ACAAGTTCAA GCTGGACTGA AGACCTTGGA GCGACTCTGC CCCAOTGGOQ ATOCTGOOOC 1020 
CGCCCTGCTG GCCTCCCTTC TCCCCTCAAC CCTTGAGATG Al 1 1 1U1C1 1 TTCAACTTCT 1080 
TOAACTTGGA CATO AAOGAT GTGGGCCCAG AATCATGTGG CCAGCCCACC CCCTGTTGGC 1140 
CXTTCACCAGC CITGG AGTCT GTTCTAQCGA AGGCCTCCCA GCATCTOGGA CTCGAGAGTG 1200 
5 GGCAGCCCCT CTACCTCCTG G AGCTGAACT GGGGTGGAAC TOAGTGTGCT CTTAGCTCT A 1260 
CCGGGAGGAC AGCTGCCTGT TTCCTCCCCA TCAGCCTCCT CCCCACATCC CCAGCTGCCT 1320 
GGCTGGGTCC TGAAGCCCTC TGTCTACCTG GGAGAOCAGG OAOCACAGGC CTTAGGGATA 1380 
CAGGGGGTCC CC1 IOG1 1 A CCACCCCCCA CCCTOCrCCA GQACACCACT AOOTQQTGCT 1440 
GGATGCTTGT TCTTTGGCCA GCCAAGGTTC ACGGCGATTC TCCCCATGGG ATCTTG AGOG 1500 

10 ACCAAGCTGC TGGGATTGGG AAGGAGTTTC ACCCTGACCA TTGCCCTAGC CAGGTTCCCA 1560 
GG AGGCCTCA CCATACTCCC TTTCAGGCKX AGGGCTCXAG CAAGCOCAGG GCAAGGATCC 1620 
TGTGCTGCTG TCTGGTTCAG AGCCTGCCAC CGTGTGTCOG GAGTGTGGGC CAGGCTG AGT 1680 
GCATAGGTGA CAGGGCCGTG AGCATGGGCC TGGGTGTGTG TGAGCTCAGG CCTAGGTGCG 1740 
CAGTGTGGAG ACGGGTGTTO TCGGGGAAGA GGTGTGGCTT CAAAGTGTGT GTGTGCAGGG 1800 

15 GGTGGGTGTG TTAGCGTGGG TTAGGGGAAC GTGTGTGCGC GTGCTGGTGG GCATGTGAGA 1860 
TGAGTGACTG CCGGTGAATG TGTCCACAGT TGAG AGGTTG GAGCAGGATG AGGGAATCCT 1920 
GTCACCATCA ATAATCACTT GTGGAGCGCC AGCTCTGCCC AAGGCGCCAC CTGG GCGGA C 1980 
AGCCAGGAGC TCTCCATGGC CAGGCTGCCT GTGTGCATGT TOCCTGTCTG GTGCCOCTTT 2040 
GCCCGCCTCC TGCAAACCTC ACAGGGTOOC CACACAACAG TGCCCTCCAG AAOCAOCCOC 2100 

20 TCGGAGGCAG AGGAAGGAAA ATGGGGATGG CTGGGGCTCT CTCCATCCTC ITITI CTCC T 2160 
TGCCTTCGCA TGGCTGGCCT TCCCCTCCAA AACCTCCATT CUXIUCTGC CAGCCCCITT 2220 
GCCATAGCCT GATTTTGGGG AGGAGGAAGG GGCGATTTGA GGGAGAAGGG GAGAAAGCTT 2280 
ATGGCTGGGT CRM 1 1 1U1 1 CCCTTCCCAG AGGGTCTTAC TGTTOCAGGG TGOCCCCAGO 2340 
OCAGGCAGGG O CCACACTAT GCCTGCGCCC TCGTAAAGGT GACCCCTGCC AT TTACC AGC 2400 

25 AGOCCTGGCA TGTTOCTGCC CCAGAOGAAT AGAATGGAGG GAGCTCCAGA AACT TTCCAT 2460 
CCCAAAGGCA UTCTOOGTGG TTGAAGCAGA CTGGATTTTT GC1C1GCCCC TGACCCCTTO 2520 
TCCCTCTTTG AGGGAGGGGA GCTATGCTAG GACTCCAACC 1CAGGGACTC GGGTGGCCTG 2580 
CGCTAGCTTC TTTTGATACT GAAAACTTTT AAGGTGGGAG GGTGGCAAGG GATGTGCTTA 2640 
ATAAATCAAT TCCAAGCCTC AAAAAAAAAA AAAAAAAAAA AA 

Sea ID NO: 50 Protein sequence 
Protein Accession ft NP_219487.1 

1 U 21 31 41 51 

MKDVGPBSOG QCTPCWPSPA LESVLGKASQ HLGLESGQPL YLLELNWGGT ECALSSTGRT 60 
AAOFLPISLL PTSPAAWLGP BALCLPGKPG TTGLRDTGGP LLLPPPTLLQ DTTRWCWMLV 120 
LWPAKVHGDS PHGILRDQAA G2GKEFHPDH CPSQYPRRPH HTPFQGOGSS KPRARILCCC 180 
LVESLPPCVG SVGQAEOGD RAVSMGLGVC ELRPRCAVWR RVLSGKRCGF KVCVCRGWVC 



Seq ID Nth 51 DNA Beqwmrr 

Nucleic Acid Accession ft XMJ&59098.1 

Coding sequence: 1 78-61 SQmderiined scqncnces correspond to start and stop codoas) 

45 

1 11 21 31 41 51 

GATGTACACT CTGAAGTG AG CACATTCCTG TTGGCAGGAC ATG ACACCTT GGCAGCAAGC 60 

ATCTCCTGGA TCCTTTACTG CCTGGCTCTG AAOCCTGAGC ATCAAGAGAG ATGCCGGGAG 120 
50 GAGGTCAGGG GCATCCTGGG GGATGGGTCT TCTATCACTT GGGAOCAGCT GGOTGA OATG 180 

TXXjTACACCA CAATGTGCAT CAAGGAGACG TGCCGATTCA TTCCTGCAGT CCCGTCCATT 240 

TCCAGAGATC TCAGCAAGCC ACTTACCTTC CCAGATGGAT GCACATTGCC TGCAGGGATC 300 

ACCGTGGTTC TTAGTATTTG GGGTCTTCAC CACAACCCTG CTGTCTGGAA AAACCCAAAG 360 

GTCTTTGACC CCTTGAGGTT CTCTCAGGAG AATTCTGATC AGAGACACCC CTATGCCTAC 420 
55 TTACCATTCT CAGCTGGATC AAGGAACTGC ATTGGGCAGG AGTTTGCCAT GATTGAGTTA 480 

AAGGTAACCA TTGCCTTGAT TCTGCTCCAC TTCAGAGTGA CTCCAGACOC CAOCAGGOCT 540 

CITACTTTOC CCAAOCATTT TATOCTCAAO COCAAOAATG GGAT OTATIT GCAOCT GAAG 600 
. AAACTCTCTQ AATOTTAGAT CTCAGGGT AC AATGATTAAA OGTACTl 1U1 1 1 1 1CGAAGT 660 
„ TAAATTTACA GCTAATGATC CAAGCAGATA GAAAGGGATC AATGTATGGT GGGAGGATTG 720 
60 GAGGTTGGTC GOATAGGGGT CTCTGTGAAO AGATCCAAAA TCATTTCTAG GTACACAGTO 780 

TCTCAGCTAO AlClUl 1 IVY ATATAACTTT GGOAGATTTT CAGATCTTTT CTGTTAAA CT 840 

TTCACTACTA TTAATOCTGT ATACACCAAT AG ACTTTCAT ATATTTTCTG Ill/IUUAA 900 

AATAGTTTTC AGAATTATGC AAGTAATAAG TGCATGTATG CTCACTGTCA AAAATTCCCA 960 

ACACT AG AAA ATCATGTAGA ATAAAAATTT TAAATCTCAC TTCACTTAGC CGA CATTC CA 1020 
65 TGCCCTGACC AATOCTACTO CTTTTCCTAA AAACAGAATA ATTTOGTOTG CATTCTTTCA 1080 

GACTTTTTCC TATACATTTT ATATGTAGAA ATGTAGCAAT GTATTTOTAT AG ATOTOATC 1140 

ATTCCTATAT TGTTATTGAT 111 1 1 1C ACT TAATAAAAAT TCACCTTATT CCTT 

S^FI)NC* 52 Protein scqcencg: 
70 Protein Accession^ XP_059098.1 

f 

1 11 21 31 41 51 
I I I I I I 

MSYTTMCIKE TCRLIPAVPS ISRDLSKPLT FPDGCTLPAG ITWLSIWGL HHNPAVWKNP 60 
75 KVFDPLRFSQ ENSDQRHFYA YLPFSAGSRN OGQEPAMIE LKVTIALILL HFRVTPDPTR 120 
PLTFPNHFIL KPKNGMYLHL KXLSEC 
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Seq ID NO: 53 DNA sequence 

Nucleic Acid Accession $: NMJD30916 

Coding sequence: 1-1533 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 . 51 

ATGCCCCTGT COCTTK3GAGC COAGATGTGO GGGCCTG AGG CCTGOCTGCT GCTGCTGCTA 60 
CTGCTGGCAT CATTTACAGG CCGGTGCCCC GCGGGTGAGC TGGAGACCTC AGACGTGGTA 120 
ACTGTGGTGC TGGGCCAGGA OGCAAAACTG CCCTGCTTCT ACCGAGGGGA CTCCGGCGAG 180 
CAAGTGGGGC AAGTGGCATG GGCTCGGGTG GACGCGGGCO AAGGCGCCCA GGAACTAGCG 240 
CT ACTGCACT CCAAAT ACGG GCTTCATGTG AGCCOGGCTT ACGAGGGCCG CGTGGAGCAG 300 
CCGCCGCCCC CACGCAACCC CCTGGACGGC TCAGTGCTCC TGCGCAACGC AGTGCAGGCG 360 
GATGAGGGCG AGTACGAGTG COGGGTCAGC AOCTTOCOCG CCGGCAGCTT CCAGGCGCGG 420 
CTGCGGCTCC GAGTGCTGGT GCCTCCCCTG CCCTCACTGA ATCCTGGTCC AGCACTAGAA 480 
GAGGGCCAGG GCCTGACCCT GGCAGCCTCC TGCACAGCTG AGGGCAGCCC AGCCCCCAGC 540 
GTGACCTGGG ACACGGAGGT CAAAGGCACA ACGTCCAGCC GTTCCTrCAA GCACTCCCGC 600 
TCTGCTGCCG TCACCTCAGA GTTCCACTTG GTGCCTAGCC GCAGCATQAA TGGGCAGCCA 660 
CTGACTTGTG TGGTGTCCCA TCCTGGCCTG CTCCAGGAOC AAAGG ATCAC CCACATCCTC 720 
CACGTGTCCT TCCTTGCTGA GGCCTCTGTG AGGGGCCTTG AAGACCAAAA TCTGTGGCAC 780 
ATTGGCAGAG AAGGAGCTAT GCTCAAGTGC CTGAGTGAAG GGCAGCCCCC TCCCTCATAC 840 
AACTGGACAC GGCTGGATGG GCCTCTGCCC AGTGGGOTAC GAGTGGATGG GGACACTTTO 900 
GGCTTTCCCC CACTGACCAC TGAGCACAGC GGCATCTACG TCTGCCATGT CAGCAATGAG 960 
TTCTCCTCAA GGGATTCTCA GGTCACTGTG GATGTTCTTG ACCCCCAGGA AGACTCTGGG 1020 
AAGCAGGTGG ACCTAGTGTC AGOCTCQGTG GTGGTGGTGG GTOTGATCGC CGCACTCTTG 1080 
TTCTGCCTTC TGGTGGTGGT OGTGGTGCTC ATGTCCCGAT ACCATOGGCG CAAGGCOCAG 1140 
CAGATGACCC AGAAATATGA GGAGGAGCTG ACCCTGACCA GGGAGAACTC CATCCGGAGG 1200 
CTGCATTCCC ATCACACGGA CCCCAGGAGC CAGCCGGAGG AGAGTGTAGG GCTGAGAGCC 1260 
G AGGGOCACC CTGATAGTCT CAAGGACAAC AGTAGCTGCT CTGTGATGAG TGAAGAGCCC 1320 
G AGGGCCGCA GTTACTCCAC GCTGACCACG GTGAGGGAGA TAGAAACACA GACTGAACTG 1380 
CTGTCTCCAG GCTCTGGGCG GGCCGAGGAG GAGGAAGATC AGGATGAAGG CATCAAACAG 1440 
GCCATGAACC ATTTTGTTCA GGAGAATGGG ACCCTACGGG CCAAGCCCAC GGGCAATGGC 1500 
ATCTACATCA ATGGGCGGGG ACACCTGGTC TGA 

Seq ID NO: 54 Protein sequence: 
Protein Accession ft NPJ 121 78.1 

] II 21 31 41 51 

MPLSLGAEMW GPEAWTLLUX LLASFTGRCP AGELBTSD W TWLGQDAKJL PCFYRGDSGE 60 
QVGQVAWARV DAGEGAQELA LLHSKYGLHV SPAYEGRVEQ PPPFRNPLDG SVLLRNAVQA 120 
DBGEYECRVS TFPAQSFQAR LRLRVLVPPL PSLNPGPAUB BOQOLTLAAS CTAEOSPAPS ISO 
VTWDTEVKGT TSSRSFKHSR SAAVTSEFHL VPSRSMNGQP LTCWSHPGL LQDQRITHIL 240 
HVSFLAEASV RGLEDQNLWH IGRBGAMLKC LSEGQPPPSY NWTRLDGPLP SG VRVDGDTL 300 
GFPPLTTEHS GIYVCHVSNE FSSRDSQVTV DVLDPQHDSG KQVDLVSASV VWGVIAALL 360 
FCLLVWWL MSRYHRRKAQ QMTQKYEEEL TLTRENSIRR LHSHHTDPRS QPHESVGLRA 420 
EGHPDSLKDN SSCSVMSEEP EGRSVSTLTT VRElKlt/lhL LSPGSGRAEE EEDQDEGIKQ 480 
AMNHFVQHNG TLRAKPTGNG IYINGRGHLV 

Seq ID NO: 55 DNA sequence 

Nucleic Acid Accession #: AF007170.1 

Coding sequence: 73-1725 (underhned sequences c<mes|)ond to stait and stop codons) 
1 11 21 31 41 51 

AAGGAGGCGG CCTCXX5GGAA AAGCGACCGC AGGACTCCTG AGAGCAGCCT CCATGAGGCC 60 
CTGGACCAGT GCATGACCGC CC^GGACCTC TTCCTCACCA ACCAGTTCTC AGAAGCi\CTC 120 
AGCTACCTCA AGCCCAGAAC CAAGGAAAGC ATGTACCACT CACTGACATA TGCCACCATC 180 
CTGGAGATGC AGGCCATGAT GACCTTTGAC CCTCAGG ACA TCCTGCTTGC CGGCAACATG 240 
ATGAAGGAGG CACAGATGCT GTGTCAGAGG CACCGGAGGA AGTCTTCTGT AACAGATTCC 300 
TTCAGCAGCC TGGTGAACCG CCCCACGCTG GGCCAATTCA CTGAAGAAGA AATCCACGCT 360 
GAGGTCTGCT ATGCAGAGTG CCIGCTGCAG CGAGCAGCCC TGACCTTCCT GCAGGACGAG 420 
AACATGGTGA GCTTCATCAA AGGCGGCATC AAAGTTCGAA ACAGCTACCA GACCT ACAAG 480 
GAGCTGGACA GCCTTGTTCA GTCCTCACAA TACTGCAAGG GTGAOAACCA C CCGC ACTTT 540 
GAAGGAGGAG TGAAGCTTOO TGTAGGGGCC TTCAACCTGA CACTGTCCAT GCTICCTACT 600 
AGGATCCTGA GGCTGTTGOA lil 1 f G TGGCG IM IUAGGAA ACAAGGACTA TGGGCTGCTG 660 
CAGCTGGAGG AGOG AGCGTC AGGGCACAGC TTCCGCTCTG TGCTCTGTGT CATGCTCCTG 720 
CTGTGCTACC ACACCITCCT CAOCTTCGTG CTCGGTACTG GGAACGTCAA CATCG AGGAG 780 
GCCGAGAAGC TCTTG AAGCC CTACCTGAAC CGGTACCCTA AGGGTGCCAT CTTCCTGTTC 840 
TTTGCAGGGA GGATTGAAGT CATTAAAGGC AACATTGATG CAGCCATCCG GCGTTTCGAG 900 
GAGTGCTGTG AGGCCCAGCA GCACTGGAAG CAGTTTCACC ACATGTGCTA CTGGGAGCTG 960 
ATGTGGTGCT TCACCTACAA GGGCCAGTGG AAGATGTCCT ACTTCTACGC CGACCTGCTC 1020 
AGCAAGGAOA ACTGCTGGTC CAAGOCCACC TACATTTACA TG AAGGCCGC CTACCTCAGC 10B0 
ATOTTTOGOA AOOAOOACCA CAAGCCGTTCGGGGACGACG AAGTGGAATT ATTTCGAGCT 1140 
GTGCCAGGCC TGAAGCTCAA GATTGCTGGG AAATCTCTAC CCACAGAGAA GTTTGCCATC 1200 
CGGAAGTCCC GGCGCTACTT CICCTCCAAC CCTATCTOGC TGCCAGTGCC TGCTCTGG AA 1260 
ATGATGTACA TCTGGAACGG CTACGCCGTG ATTGGGAAGC AGCCGAAACT CACGGATGGG 1320 
ATACTTGAGA TTATCACTAA GGCTGAAGAG ATGCTGG AGA AAGGCCCAGA GAACGAGTAC 1380 
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TCAOTOGATG ACOAGTGCTT GGTG AAATTO TTOAAAGGCC TOTGTCTGAA ATACCTGGGC 1440 
CGTGTCCAGG AGGCCGAGGA GAATTTTAGG AGCATCTCTG CCAATGAAAA GAAGATTAAA 1500 
TATGACCACT ACTTG ATCCC AAACGCCCTG CTGG AGCTGG CCCTGCTGCT TATGGAGCAA 1560 
GACAGAAACG AAGAGGCCAT CAAACTTTTG GAATCTGCCA AGCAAAACTA CAAGAATTAC 1620 
TCCATGGAGT CAAGGACACA CTTTCGAATC CAGGCAGCCA CACTCCAAGC CAAGTCTTCC 1680 
CTAG AGAACA GCAGCAG ATC CATGGTCTCA TCAGTGTCCT TGTAGCTTTG TGCAGCAGTT 1740 
CCGGGCTGGA AGACAGAGAC AGCTGGACAG AGCTCCTGAA AACATTTCAA AATACCCCCT 1800 
CCCCCTGCCC TGCCCTOCCT TTOOGGTCCA CCGGCACTOC AOTTGGATGG CACAACATAQ 1860 
TGTATCCGTG CAGAAGCCG A GCTGGCATTT TCACCAGTGT AGOCAAGGGC CTTTGCCAAG 1920 
GGCAGAGCAG GTGGAGCCCT CTGCCTGCCC TATCACACAT AOGGGTACTT GCTTTTCACT 1980 
GTGATGTTTA AGAGAATGTA TGAACAGTTT ACATnTCCT TAGAAATACA TTGATGGGAT 2040 
CACAGTTGGC TTTAAAAACC AACAACAATC AACCACCTGT AAGTCTTTGT CTTCACCTAT 2100 
TATCATCTGG AGGTAAATCT CTTTATATGA TG ATGCCAAA GGGCAAATTG CTTTTCAAAT 2160 
TCAGCAAGTT CTCAGCTTGT GTGACGGAAG OTCCTTCAGA GGACCTGAGG AATGCCTGGG 2220 
AG AGGCTAAG CCTCAGGCTT CAATGCTTCT GOGGTTGGGC ATGAGGATGT ACACAGACAC 2280 
CCACTACCTT ACT ACTCACA CITCATTTCA CTCCTTTTGT AAATTTCCAA TTTAAAAATC 2340 
AAGCACGTCT TTTTAGTGAG ATAAAATCTG AGCTCTTCTQ TAGAAAAATC AATCTCTACC 2400 
AGTAGAAAAT GCCAGGGCTT GATGGAAGAG CTGTGTAGOC CTTTCTATGC CAAAGOCAGG 2460 
AAATTTGGGG GGCAGGAGGA GGTTCTCAGA ATCCAGTCTG TATCTTTGCT GTATGCCAAA 2520 
CTGAAACCAC TGGGAATAAT TTATGAAACA TAAAAATCTT CTGTACTTCA CTCCAAGGTA 2580 
CATITGCITA CTGACAGCAT TTTTGTTAAA ACTGTTATTC TTGAAAAAAA AAAAAAAAAA 2640 
AA 

Sea ID NO: 56 Protein sequence: 
Protein Accession ft AAC39582.1 

1 11 21 31 41 51 

MTAli>LFLTN QFSEALSYLK PRTKESMYHS LTYATILEMQ AMMTFDPQDI LLAGNMMKEA 60 
QMLCQRHRRK SSVTDSFSSL VNRPTLGQFT EEECHAEVCY AECLLQRAAL TFLQDENMVS 120 
FDCGGDCVRN SYOTYKELDS LVQSSQYCKG ENHPHFEGGV KLGVGAFNLT LSMLPTCULR 180 
LLEFVGFSGN KDYGLLQLEE GASGHSFRSV LCVMLLLCYH TFLTFVLGTG NVNIEEAEKJL 240 
LKPYLNRYPK GAIFLFFAGR IEV1KGNIDA AIRRFEECCB AQQHWKQFHH MCYWHLMWCF 300 
TYKGQWKMSY FYADLLSKEN CWSKATYIYM KAAYLSMFGK EDHKPFG DDE VELFRAVPGL 360 
KLK1AGKSLP TEKFAJRKSR RYFSSNPISL PVPALEMMYI WNGYAVK3CQ PKLTDGILEI 420 
ITKAEEMLEK GPENEYSVDD ECLVKIXKGL CLKYLGRVQB ABENFRSISA NEKKDCYDHY 480 
LIPNALLELA LLLMEODRNE EADOXESAK QNYKNYSMES RTHFRIOAAT LQAKSSLENS 540 
SRSMVSSVSL 

Seq ID NO: 57 DNA sequence 

Nacktc Add Accession «: NM 006670.1 

rVuHng gegi te nne; 1 J927 (imrferirocd sequences cuiiespond to start and stopcodons) 
1 11 21 31 41 51 

ATGCCTOGGG GGTGCTCCCG GGGCCOCGOC GCCGGGG ACG GOCGTCTGCG GCTGGCGCGA 60 
CT AGCGCTGG TACTCCTGGG CTGGGTCTCC TCGTCTTCTC CCACCTCCTC GGCATCCTCC 120 
TTCTCCTCCT CGGCGCCGTT CCrGGCTTOC GCCGTGTCCG CCCAGCCCCC GCTGCCGG AC 180 
CAGTGCCCCG CGCTGTGCGA GTGCTCCGAa GCAGCGCGCA CAGTCAAGTG CGTTAACCGC 240 
AATCTGACCG AGGTGCCCAC GGACCTGCOC GCCTACGTGC GCAACCTCTT CCTTACOGGC 300 
AACCAGCTGG CCAGCAACCA CTTCCTTTAC CTGCCGCGGG ATGTGCTGGC CCAACTGCCC 360 
AGCCTCAGGC ACCTGGACTT AAGTAATAAT TCGCTGGTGA GCCTGACCTA CGTGTCCTTC 420 
CGCAACCTGA CACATCTAOA AAGCCTCCAC CTGGAGGACA ATGCCCTCAA GGTCCTTCAC 480 
AATGGCACCC TGGCTGAGTT GCAAGGTCTA CCCCACATTA GGGTTTTCCT GGACAACAAT 540 
COCIGGGTCT GCGACTGCCA CATGGCAOAC ATGGTGACCT GGCTCAAGGA AACA GAGGTA 600 
GTGCAGGGCA AAGACCGGCT CACCTOTGCA TATCCGGAAA AAATGAGGAA TCGGOTCCTC 660 
TTGGAACTCA ACAGTGCTGA CCTGGACTGT GACCOGATTC TTCCCCCATC CCTGCAAACC 720 
TCTTATOTCT TCCTGGGTAT TGTTTTAGOC CTGATAGGCG CTATTTTOCT CCIGGTTTTG 780 
TATTTGAACC GCAAGGGGAT AAAAAAGTGG ATGCATAACA TCAGAGATGC CTGCAGGGAT 840 
CACATGGAAG GGTATCATTA CAGATATG AA ATCAATOCOG ACCCCAGATT AACAAACCTC 900 
AGTTCTAACT CGG ATGTCCT CGA GTOA 

Seq ID NO: 58 Protein sequence 
Proton Accession #: NP.006661.1 



1 11 21 31 41 51 
t I I 1 t t 

MPGGCSRGPA AGDGRLRLAR LALVLUGWVS SSSCTSSASS FSSSAPFLAS AVSAQPPLPD 60 
QCPALCECSE AARTVKCVNR NLTEVFTDLP AYVRNLFLTG NQLASNHFLY LPRDVLAQLP 120 
SLRHLDLSNN SLVSLTYVSF RNLTHLESLH LEDNALKVLH NGTLAELQGL PHIRVFLDNN 180 
PWVCDCHMAD MVTWLKETBV VQGKDRLTCA YPEKMRNRVL LBLNSADLDC DPXLPPSLQT 240 
SYVFLG1VLA UG AJFLLVL YLKRKGXKKW MHNTRDACRD HMEGYHYRYB INADPRLTNL 300 
SSNSDVLE 

Seq ID NO: 59 DNA sequence 

Nucleic Acid Accession & NM 024022 

372 



WO 02/059377 



Coding sequence: 1-1 362(mideiimed sequences correspond to start and stop codons) 
1 11 21 31 41 51 

ATGOGGGAAA ATGATOCOOC TOCTOTTOAA QCCCCCTTCT CAT TCCOA TC GCTTTTTGOC 60 
CTTGATGATT TOAAAATAAG TCCTGTTGCA CCAGATGCAG ATGCTGTTGC TGCACAGATC 120 
CTGTCACTGC TGCCATTGAA GTTTTTTCCA ATCATCGTCA TTGGGATCAT TGCATTGATA 180 
TTAGCACTGG CCATTGGTCT GGGCATCCAC TTCGACTGCT CAGGG AAGTA CAGATGTCGC 240 
TCATCCTTTA AGTGTATCGA GCTGATAGCT CGATGTGACG GAGTCTCGGA TTQCAAA GAC 300 
GGGGAGGACG AGTAOCGCTG TGTCCGGGTG GGTGGTCAGA ATGCCGTGCT CCAGGTGTTC 360 
ACAGCTGCTT CGTGGAAGAC CATGTGCTCC GATGACTGGA AGGGTCACTA CGCAAATGTT 420 
GCCTGTGCCC AACrGGGnT CCCAAGCTAT GTGAGTTCAG ATAACCTCAG AGTGAGCTCG 4S0 
CTGGAGGGGC AGTTCCGGGA GGAGTTTGTG TCCATCG ATC ACCTCTTGCC AGATGACAAG 540 
GTGACTGCAT TACACCACTC AGTATATGTG AGGGAGGGAT GTGCCTCTGG OCAOGTGGTT 600 
ACCTTGCAGT GCACAGCCTG TGGTCATAGA AGGGGCTACA GCTCACGCAT CGTGGGTGGA 660 
AACATGTCCT TGCTCTCGCA GTGGCCCTGG CAGGCCAGCC TTCAGTTCCA GGGCTACCAC 720 
CTGTGCGGGG GCTCTGTCAT CACGCCCCTG TGGATCATCA CTGCTGCACA CTGTGTTTAT 780 
GACTTGTACC TCCCCAAGTC ATGGACCATC CAGGTGGGTC TAGTTTCCCT GTTGGACAAT 840 
CCAGCCCCAT COCACTTGGT GGAGAAGATT GTCTACCACA GCAAGTACAA GOCAAAGAGG 900 
CTGGGCAATG ACATCGCCCT TATGAAGCTO GCCGGGCCAC TCACOTTCAA TGAAATGATC 960 
CAGCCTGTGT GCCTGCCCAA CTCTGAAGAG AACTTCCCCG ATGGAAAAGT GTGCTGGACG 1020 
TCAGGATGGG GGGCCACAGA GGATGGAGGT GACGCCTCCC CTGTCCTGAA CCACGCGGOC 1080 
GTCCCTTTGA TTTCCAACAA GATCTGCAAC CACAGGGACG TGTACGGTGG CATCATCTCC 1 140 
CCCTCCATGC TCTGCGCGGG CTACCTGACG GGTGGCGTGG ACAGCTGCCA GGGGGACAGC 1200 
GGGGGGCCCC TGGTGTGTCA AGAGAGGAGG CTGTGGAAGT TAGTGGGAGC GACCAGCTTT 1260 
GGCATCGGCT GCGCAGAGGT GAACAAGCCT GGGGTOTACA CCCGTGTCAC CTCCTTCCTG 1320 
GACTGGATCC ACQ AGCAGAT GGAG AGAGAC CTAAAAAC CTGA 

SeqIDNQ: 60 Protein sequence 
Proirin Accession #: NP 076927 

1 11 21 31 41 51 

MGENDPPAVB APFSFRSLFG LDDLKISPVA PDADAVAAQI LSL1PLKFFP IIVIGIIALI 60 
LALAIGLGIH FDCSGKYRCR SSFKCTEUA RCDGVSDCKD GEDEYRCVRV GGQNAVLQVF 120 
TAASWKTMCS DDWKGHYANV ACAQLGFPSY VSSDNLRVSS LEGQFREEFV SIDHLLPDDK 180 
VTALHHSVYV RBGCASGHW TLQCTACGHR RGYSSRIVGG NMSLLSQWPW QASLQFQGYH 240 
LCGGSVITPL WDTAAHCVY DLYLPKSWT1 QVGLVSLLDN PAPSHLVBKI VYHSKYKPKR 300 
LGNDiALMKL AGPLTFNEMI QPVCLPNSEB NFPDGKVCWT SGWGATHDGG DASPVLNHAA 360 
VPLISNKICN HRDVYGGHS PSMLCAGYLT GGVDSCQGDS GGFLVCQERR LWKLVGATSF 420 
G1GCAEVNKP GVYTRVTSFL DWIHEQMERD LKT 

Seq IDNOt 61 DNA sequence 

Niiclek Acid Accession #: NM_006475 

Coding srqurorrr 28-2538 (underlined se qu en c es correspond to start and stop codons) 
I 11 21 31 41 51 

AACAGAACTG CAACGGAG AG ACTCAAGATG ATTCCCTTTT TACCCATGTT TTCTCTACTA 60 
TTGCTGCTTA TTGTTAAOCC TATAAACGCC AACAATCATT ATG ACAAGAT CTT OGCT CAT 120 
AGTCGTATCA GGGGTCGGGA CCAAGGCCCA AATGTCTGTG CCCTTCAACA GATTTTGGGC 180 
ACCAAAAAGA AATACTTCAG CACTTGTAAG AACTGGTATA AAAAGTCCAT CTGTGGACAG 240 
AAAACGACTG TTITATATGA ATGTTGCCCT GGTTATATGA GAATGGAAOG AATGAAAGGC 300 
TGCCCAGCAO TTTTGCCCAT TGAOCATOTT TATGGCACTC TGGGCATCGT GGGAGGCAOC 360 
ACAACGCAGC GCTATTCTGA CGCCTCAAAA CTGAGGGAGG AGATCGAGGG AAAGGGATCC 420 
TTCACTTACT TTGCACCGAG TAATGAGGCT TGGGACAACT TGGATTCTGA TATCCGTAGA 480 
GGTTTGOAGA GCAACOTGAA TGTTGAATTA CTGAATGCTT TACATAGT CA CATOATTAAT 540 
AAGAG AATGT TGACCAAGG A CTTAAAAAAT GGCATGATTA TTOCTTCAAT GTATAACAAT 600 
TTGGGGCTTT TCATTAACCA TTATOCTAAT GGGGTTGTCA CTGTTAATTG TGCTCGAATC 660 
ATCCATGGGA ACCAGATTGC AACAAATGGT GTTGTCCATG TCATTQACCG TGTCCTTACA 720 
CAAATTOGTA CCTCAATTCA AGACTTCATT GAAGCAGAAG ATGACCTTTC ATCTTTTAGA 780 
GCAGCTGCCA TCACATCGGA CATATTGGAG GCCCTTGGAA GAGACGGTCA CTTCACACTC 840 
T1TGCTCCCA CCAATG AGGC TTTTOAGAAA CTTCCACOAO OTOTCCTAGA AAGOTTCATQ 900 
GGAGACAAAG TGGCTTCCGA AGCTCTTATG AAGTACCACA TCTTAAATAC TCTCCAGTGT 960 
TCTGAGTCTA TTATGGGAGG AGCAGTCTIT GAGACGCTGG AAGGAAATAC AATTGAGATA 1020 
GGATGTG AOG GTGACAGTAT AACAGTAAAT GO AATCAAAA TGGTQAACAA AAAGG ATATT 1080 
GTOACAAATA ATGGTGTGAT CCATTTOATT QATCAGOTCC TAATTCCTGA TTCTGCCAAA 1140 
CAAGTTATTG AGCTGGCTGG AAAACAGCAA ACCACCTTCA CXKJATCTTGT GGOCGAATTA 1200 
GGCTTGGCAT CTGCTCTGAG GCCAGATGGA GAATACACTT TGCTGGCACC TGTGAATAAT 1260 
GCATTTTCTG ATG ATACTCT CAGCATGGTT CAGCGCCTCC TTAAATTAAT TCTGCAGAAT 1320 
CACATATTGA AAGTAAAAGT TGGCCTTAAT GAGCTTTACA ACGGGCAAAT ACTGGAAACC 1380 
ATCGGAGGCA AACAGCTCAG AGTCTTCGTA TATCGTACAG CTOTCTGCAT TGAAAATTCA 1440 
TGCATGGAG A AAGGGAGTAA GCAAGGGAGA AACGGTGCGA TTCACATATT OOGCGAGATC 1500 
ATCAAGCCAG CAGAGAAATC CCTCCATGAA AAGTTAAAAC AAGATAAGCG CTTTAGCACC 1560 
TTCCTCAGCC TACTTGAAGC TGCAGACTTG AAAGAGCTCC TGACACAACC TGGAG ACTGG 1620 
ACATTATTTG TGCCAACCAA TGATGCTTTT AAGGGAATGA CTAGT0 AAGA AAAAGAAATT 1680 
CTGATACXK3G ACAAAAATGC TCTTCAAAAC ATCATTCTTT ATCACCTGAC ACCAGG AGTT 1740 
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TTCATTGGAA AAGGATTTGA ACCTGGTGTT ACTAACATTT TAAAGACCAC ACAAGG AAGC 1800 
AAAATCTTTC TGAAAG AAGT AAATOATACA CTTCTGGTGA ATG AATTGAA ATCAAAAG AA I860 
TCTGACATGA TGACAACAAA TGGTGTAATT CATGTTGTAG ATAAACTCCT CTATCCAGCA 1920 
GACACACCTG TTGGAAATGA TCAACTGCTG GAAATACTTA ATAAATTAAT CAAATACATC 1980 
CAAATTAAGT TTGTTCGTGG TAGCACCTTC AAAGAAATCC CCGTGACTGT CTATACAACT 2040 
AAAATTATAA CCAAAGTTGT GGAACCAAAA ATTAAAGTGA TTGAAGGCAG TCTTCAGCCT 2100 
ATTATCAAAA CTGAAGGACC CACACTAACA AAAGTCAAAA TTGAAGGTGA ACCTGAATTC 2160 
AGACTGATTA AAGAAGGTGA AACAATAACT GAAGTGATOC ATGGAQAGCC AATTATTAAA 2220 
AAATACACCA AAATCATTGA TGG AGTGOCT GTGG AAATAA CTGAAAAAGA GACACGAGAA 2280 
GAACGAATCA TTACAGGTCC TG AAATAAAA TACACTAGGA TTTCTACTGO AGGTGGAGAA 2340 
ACAGAAGAAA CTCTGAAGAA ATTGTTACAA GAAGAGGTCA CCAAGGTCAC CAAATTCATT 2400 
GAAGGTGOTG ATGGTCATTT ATTTGAAGAT GAAGAAATTA AAAGACTGCT TCAGGGAGAC 2460 
ACACCCGTGA GGAAGTTGCA AGCCAACAAA AAAGTTCAAG GTTCTAGAAG ACGATTAAGG 2520 
GAAGGTCGTT CTCAGTgAAA ATCCAAAAAC CAGAAAAAAA TGTTTAT ACA ACCCTAAGTC 2580 
AATAACCTGA CCTTAGAAAA TTGTGAGAGC CAAGTTG ACT TCAGGAACTG AAACATCAGC 2640 
ACAAAGAAGC AATCATCAAA TAATTCTGAA CACAAATTTA ATATTTTTTT TTCTGAATGA 2700 
GAAACATGAG GGAAATTGTG GAGTTAOCCT OCTGTGGTAA AGG AATTGAA GAAAATATAA 2760 
CACCTTACAC CCTTTTTCAT CTTGACATTA AAAGTTCTGG CTAACTTTGG AATCCATTAG 2820 
AG AAAAATCC TTGTCACCAG ATTCATTACA ATTCAAATCG AAGAGTTGTG AACTGTTATC 2880 
CCATTGAAAA GACCGAGCCT TGTATGTATG TTATGGATAC ATAAAATGCA C GCAAGC CAT 2940 
TATCTCTCCA TGOGAAOCTA AGTTATAAAA ATAGGTGCTT GGTGTACAAA AC TTTTTAT A 3000 
TCAAAACGCT TTGCACATTT CTATATGAGT GGGTTTACTG GTAAATTATO TTATTTTTTA 3060 
CAACTAATTT TGTACTCTCA GAATGTTTGT CATATGCTTC TTGCAATGCA TATTTTTTAA 3120 
TCTCAAACGT TTCAATAAAA CCATTTTTCA GATATAAAGA GAATTACTTC AAATTGAGTA 31 80 
ATTCAGAAAA ACTCAAGATT TAAGTTAAAA AGTGGTTTGG ACTTGGGAA 

Sea ID NO: 62 Protein sequence: 
Protein Accession #: NP 006466 

1 11 21 31 41 51 

IflPnlpMpi 1XIXTVNPIN ANNHYDKILA HSRIRGRDQG PNVCALQQIL GTKKKYFSTC 60 
KNWYKKS1CG QKTTVLYECC PGYMRMEGMK GCPAVLPIDH VYGTLG1VGA TTTQRYSDAS 120 
KLREEIEGKG SFTYFAPSNE AWDNLDSDIR RGLESNVNVE LLNALHSHMI NKRMLTKDLK. 180 
NGMHPSMYN NLGLFINHYP NG WTVNCAR DHGNQ1ATN GWHVIDRVL TQIGTSIQDF 240 
IEAEDDLSSF RAAAITSDIL EALGRDGHFT LFAPTNEAFE KLFRGVLERF MGDKVASBAL 300 
MKYHILNTLQ CSESIMGGAV FETLEGNTTH IGCDGDSTTV NGIKMYNKKD IVTNNGVIHL 360 
IDQVUPDSA KQYBBLAGKQ QTTFTDLVAQ LGLASALRPD GEYTLLAPVN NAFSDDTLSM 420 
VQRLLKULQ NHUKVKVGL NELYNGQILE TIGGKQLRVF VYRTAVOEN SCMEKGSKQG 480 
KNGAIHIFRE HKPAEKSLH EKLKQDKRFS TFLSLLEAAD LKELLTQPGD WTXFVFTNDA 540 
FKGMTSBEKB ILIRDKNALQ NIILYHLTPG VFIGKGFEPG VTNIUCTTQG SKJFLKEVND 600 
TLLVNELKSK ESDIMTTNG V MWDKLLYP APTFVGNDQL LHLNKUK Y IQIKFVR GST 660 
FKB1PVTVYT TKHTKWEP KJKWTEGSLQ PUKTEGPTL TKVJQEGEF E JKLiKh Utili 720 
TEVIHGHPn KKYTKIIDGV PVETTEKETR EERITTGPEI KYTRl^rGGG ETEETLKKLL 780 ; 
QEEVTKVTKF IEGGDGHLFE DEEKRLLQG DTPVRKIjQAN KKVQGSRRRLREGRSQ 

Seq ID NO: 63 DNA sequence 

Hui^Ackl Accession NM_020974 . 

Caih^seqiicnce: 81-3080 (imdeiiiDed seqneni»cca7espondtostmaiHistopcodons) k 

1 11 21 31 41 51 

GGCGTCCGCG CACACCTCOC CGCGOCGCCG CCGCCACCGC CCGCACTCCG CCGCCTCTGC 60 
COGCAACCGC TGAGCCATCC ATGGGGGTCG CGGGCCGCAA COGTCCCGGG GCGGOCTGGG 120 

cxxnxjcnxKrrGcrocTOCTG cro<nxxx^ cactgctgct gctggcgggg gccgtcccgc iso 

CGGGTCGGGG COGTGCCGCG GGGOCGCAGG AGGATGTAGA TGAGTGTGOC CAAGGGCTAG 240 
ATG ACTGCCA TGCCGACGCC CTGTGTCAGA ACACACCCAC CTCCTACAAG TGCTCCTGCA 300 
AOCCTGGCTA CCAAGGGGAA GGCAGGCAGT GTGAGOACAT CGATGAATGT GGAAATGAGC 360 
TCAATGGAGG CTGTGTCCAT GACTGTTTGA ATATTCCAGG CAATTATCGT TGCACTTGTT 420 
TTGATGGCTT CATGTTGGCT CATGACGGTC ATAATTGTCT TGATGTGGAC GAGTQCCTGG 480 
AGAACAATGG OGGCTGCCAG CATACCTGTG TCAACGTCAT GGGGAGCTAT GAGTGCTGCT 540 
GCAAGGAGGG GTTTTTOCTO AGTGACAATC AGCACACCTG CATTCACCGC T0GGAAGAGG 600 
GCCTGAGCTG CATGAATAAG GATCACGGCT GTAGTCACAT CTGCAAGGAG GCCCCAAGGG 660 
GCAGCGTCGC CTGTGAGTGC AGGCCTGGTT TTGAGCTGGC CAAGAACCAG AGAGACTGCA 720 
TCTTGACCTG TAACCATGGG AACGGTGGGT GCCAGCACTC CTGTOACGAT ACAGCCGATG 780 
GCCCAGAGTG CAGCTGCCAT CCACAGTACA AGATGCACAC AGATGGGAGG AGCTGCCTTG 840 
AGCGAGAGGA CACTGTCCTG GAGGTGACAG AGAGCAACAC CACATCAGTQ GTGGATGGGG 900 
ATAAACGGGT GAAACGGCGG CTGCTCATGG AAAOGTGTGC TGTCAACAAT GGAGGCTGTG 960 
AG0GCACCTG TAAGG ATACT TOGACAGGTG TCCACTGCAG TTGTOCTGTT GGATTCACTC 1020 
TCCAGTTGG A TGGGAAGACA TGTAAAGATA TTG ATGAGTG CCAGACCCGC AATGGAGGTT 1080 
GTGATCATTT CTGCAAAAAC ATCGTGGGCA GTTTTGACTG CGGCTGCAAG AAAGGATTTA 1140 
AATTATTAAC AGATGAGAAG TCTTGCCAAG ATGTGGATGA GTGCTCTTTG GATAGGAOCT 1200 
GTGACCACAG CTGCATCAAC CACCCTGGCA CATTTGCTTG TGCTTGCAAC CGAGGGTACA 1260 
COCTGTATGG CTTCACCCAC TGTGGAGACA CCAATGAGTG CAGCATCAAC AACGGAGGCT 1320 
OTCAGCAGGT CTGTGTGAAC ACAGTGGGCA GCTATGAATG CCAGTGCCAC CCTGGGTACA 1380 
AGCTCCACTG GAATAAAAAA GACTGTGTGG AAGTGAAGGG GCTCCTGCCC ACAAGTGTGT 1440 
CACCCCGTGT GTCCCTGCAC TGCOGTAAOA OTGGTGGAGG AGAOGGGTGC TTCCTCAGAT !500 
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GTCACTCTGG CATTCACCTC TCTTCAGATG TCACCACCAT CAGGACAAGT GTAACCTTTA 1560 
AGCTAAATGA AGGCAAGTGT AGTTTGAAAA ATGCTGAGCT GTTTCCCGAG GGTCTGCGAC 1620 
GAGCACTACC AG AG AAGCAC AGCTCAGTAA AAGAG AGCTT COGCTACGTA AACCTTACAT 1680 
GCAGCTCTGG CAAGCAAGTC CCAGGAGCCC CTOGCCGACC AAGCACCCCT AAGGAAATGT 1740 
TTATCACTGT TG AGTTTGAG CTTGAAACTA ACCAAAAGGA GGTGACAGCT TCTTGTGACC 1800 
TG AGCTGCAT OGTAAAGOGA ACCGAGAAGC GGCTCCGT AA AOOCATOOGC ACGCTCAGAA I860 
AGGCCGTCCA CAGGGAGCAG TTTCACCTCC AGCTCTCAGG CATGAACXTTC GACGTGGCTA 1920 
AAAAGCCTCC CAGAACATCT GAACGCCAGG CAGAGTCCTG TGGAGTGGGC CAGGGTCATG 1980 
CAGAAAACCA ATGTGTCAGT TGCAGGGCTG GGACCTATTA TGATGGAGCA CGAGAACGCT 2040 
GCATTTTATG TCCAAATGGA ACCTTCCAAA ATGAGGAAGG ACAAATGACT TGTGAACCAT 2100 
GCCCAAOACC AGGAAATTCT GGGGCCCTGA AGACCCCAGA AGCTTOGAAT ATGTCTGAAT 2160 
GTGGAGGTCT GTGTCAACCT GGTGAATATT CTGCAGATGG CTTTGCACCT TGCCAOCTCT 2220 
GTGCCCTGGG CACGTTCCAG CCTGAAGCTG GTCGAACTTC CTGCTTCCCC TGTGGAGGAG 2280 
GCCTTGCCAC CAAACATCAG GGAGCTACTT OCTTTCAGGA CTOTGAAACC AGAGTTCAAT 2340 
GTTCACCTGO ACATTTCTAC AACAOCACCA CTCACCGATG TATTCGTTGC CCAGTOGGAA 2400 
CATACCAGCC TGAATTTGGA AAAAATAATT GTGTTTCTTG CCCAGGAAAT ACTACGACTG 2460 
ACTTTGATGG CltXACAAAC ATAACCCAGT GTAAAAACAG AAGATGTGGA GGGGAGCTGG 2520 
GAGATTTCAC TGGGTACATT GAATCCCCAA ACTACCCAGG CAATTACCCA GCCAACACCG 2580 
AGTGTACGTG GACCATCAAC CCACCCCCCA AGOGCOGCAT CCTGATCGTG GTCOCTGAGA 2640 
TCITCCTGCC CATAGAGGAC GACTGTGGGG ACTATCTGGT GATGCGGAAA ACCTCTTCAT 2700 
CCAATTCTGT G ACAACATAT G AAACCTGCC AGACCTACGA AOGOOCCATC GCCTTCACCT 2760 
CCAGGTCAAA GAAGCTGTGG ATTCAGTTCA AGTCCAATGA AGGGAACAGC GCTAGAGGGT 2820 
TCCAGGTCCC ATACGTGACA TATGATGAGG ACTACCAGGA ACTCATTGAA GACATAGTTC 2880 
GAGATGGCAG GCTCTATOCA TCTGAG AACC ATCAGGAAAT ACTTAAGGAT AAGAAACTTA 2940 
TCAAGGCTCT GTTTOATGTC CTGGOCCATC COCAGAACTA TTTCAAGTAC ACAGCCCAOG 3000 
AGTCCCGAGA GATGTTTOCA AGATCGTTCA TCCGATTGCT ACGTTCCAAA GTGTCCAGGT 3060 
TTTTGAGACC TTACAAATGA CTCAGCCCAC GTGCCACTCA ATACAAATGT TCTGCTATAG 3120 
GGTTGGTGGG ACAGAGCTGT CTrCCITCTG CATGTCAGCA CAGTCGGGTA TTGCTGCCTC 3180 
CCGTATCAGT GACTCATTAG AGTTCAATTT TTATAGATAA TACAGATATT TTGGTAAATT 3240 
GAACTTGGTT TTTCTTTCOC AGCATCGTGG ATGTAGACTG AGAATGGCTT TG AGTGGCAT 3300 
CAGCTTCTCA CTGCTGTGGG CGGATGTCTT GGATAG ATCA CGGGCTGGCT GAGCTGG ACT 3360 
TTGGTCAGCC TAGGTG AG AC TCACCTGTCC TTCTGGGGTC TTACTCCTCC TCAAGGAGTC 3420 
TGTAGTGGAA AGGAGGCCAC AGAATAAGCT GCTTATTCTG AAACTTCAGC TTCCTCTAGC 3480 
CCGGOCCTCT CTAAGGGAGC CCTCTGCACT CGTGTGCAGG CTCTGACCAG GCAGAACAGG 3540 
CAAGAGGGGA GGGAAGGAGA CCCCTGCAGG CTCOCTOCAC CCACCTTG AG ACCTGGGAGG 3600 
ACTCAGTTTC TCCACAGCCT TCTCCAGCCT GTGTGATACA AGTTTQATCC CAGGAACTTG 3660 
AGTTCTAAGC AGTGCTCGTG AAAAAAAAAA GCAGAAAGAA TTAGAAATAA ATAAAAACTA 3720 
AGCACTTCTG G AGACAT 

Sen ID NO: 64 Protein sequence: 
Protein Accession ft NP066025.1 

1 11 21 31 41 51 
I I I I I I 

MGVAGRNRPG AA WAVLUXL LLPPLLLLAG AVPPGRGRAA GPQEDVDECA QGLDDCHADA 60 
U3QNTPTSYK CSCKPGYQGB GRQCEDIDEC GNKLNGGCVH DCLNIPGNYR CTCFDGFMLA 120 
HDGHNCLDVD ECLENNGGCQ HTCVNVMGSY ECCCKEGFFL SDNQHTOHR SEEGLSCMNK 180 
DHGCSHICKE APRGSVACEC RPGFELAKNQ RDCILTCNHG NGGCQHSCDD TADGPECSCH 240 
PQYKMHTDGR SCLEREDTVL BVTESNTTSV VDGDKRVKRR LLMETCA VNN GGCDRTCKDT 300 
SIGVHCSCPV GFTLQLDGKT CKDIDECQTR NGGCDHFCKN IVGSFDCGCK KGFKLLTDEK 360 
SCQDVDECSL DRTCDHSON HPGTFACACN RGYTLYGFTH CGDTNECSIN NGGCQQVCVN 420 
TVGSYECQCH PGYKLHWNKK DCVEVKGLLP TSVSPRVSUH CGKSGGGDGC FLRCHSGIHL 480 
SSDVTTIRTS VTFKLNEGKC SLKNAELFPE GLRPALPEKH SSVKESFRYV NLTCSSGKQV 540 
PGAPGRPSTP KEMFTTVEFB LETNQKEVTA SCDl^OVKRTEKMJUCAm TLRKAVHREQ 600 
FHLQLSGMNL DVAKKPPRTS ERQAESCGVG QGHAENQCVS CRAGTYYDGA REROLCPNG 660 
TFQNEEGQMT CEPCfRPGNS QALKTPEA WN MSEOGGLCQP GEYSAPGFAP CQLCALGTFQ 720 
PEAGRTSCFP CGGGLATKHQ GATSFQDCET RVOCSPGHFy NTTTHRORC PVGTYOPEFG 780 
KNNCVSCPGN TTTDFDGS1T4 ITQCKHRRCG GELGDFTGYI ESPNYPGNYP ANTECTWTIN 840 
PPPKRRILIV VPEEFLPIED DCGDYLVMRK TSSSNSVTTY ETCQTYBRPI AFTSRSKKLW 900 
IQFKSNEGNS ARGFQVPYVT YDEDYOHLIE EWVRDGRLY A SENHQEHJCD KKLDCALPDV 960 
LAHPQNYFKY TAQESREMFP RSFIRLLRSX VSRFLRPYK 

Seq ID NO: 65 DNA sequence 

Noclefe Acid Accession ft NM_007210 

Coding seonence: 1-1869 (underlined scqoences con^^ 

1 11 21 31 41 51 
I I I I I I 

ATG AGGCTCC TCCGCAG ACQ CCACATGCCC CTGCGCCTGG CCATGGTGGG CTOCGCCTTT 60 
GTOCTCTTCC TCTTCCTCCT GCATAGGGAT GTGAGCAGCA GAGAGGAGGC CACAGAGAAG 120 
CCGTGGCTGA AGTCCCTGGT GAGCCGG AAG GATCACGTCC TGG ACCTCAT GCTGGAGGCC 180 
ATGAACAACC TTAG AGATTC AATGCCCAAG CTCCAAATCA GGGCTCCAG A AGCCCAGCAG 240 
ACTCTGTTCT CCATAAACCA GTCCTGCCTC CCTOGGTTCT ATACCCCAGC TGAACTGAAG 300 
CCCTTCTGGG AACGGCCACC ACAGGACCCC AATGCCCCTG GGGCAGATGG AAAAGCATTT 360 
CAGAAGAGCA AGTGGACCCC CCTGGAGACC CAGGAAAAOG AAGAAGGCTA TAAGAAOCAC 420 
TGTTTCAATG CCTTTGCCAG CGACCGGATC TCCCTGCAGA GGTCCCTGGG GCCAGACACC 480 
CGACCACCTG AGTGTGTGG A CCAGAAGTTC CGGCGCTCCC CCCCACTGGC CACCACCAGC 540 
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GTGATCATTG TGTTCCACAA COAAGCCTGG TCCACACTOC TGCGAACAGT GTACAOCOTC 600 
CTACACACCA CCCCTGCCAT CTTGCTCAAG GAGATCATAC TGGTGGATGA TGCCAGCACA 660 
GAGGAGCAGC TAAAGGAGAA GCTGGAGCAG TACGTGAAGC AGCTGCAGGT GGTGAGGGTG 720 
GTGCGGCAGG AGGAGCGGAA GGGGTTGATC ACCGCCCGGC TGCTGGGGGC CAGCGTGGCA 780 
5 CAGGCGGAGG TGCTCACGTT QCTGGATGCC CACTGTGAGT GCTTCCACGG CTGGCTGGAG 840 
CCCCTCCTGG CTCGAATCGC TGAGGACAAG ACAGTGGTGG TG AGCCCAGA CATCGTCACC 900 
ATCGACCTTA ATACTTTTGA GTTCGCCAAG CCCGTCCAG A GGGGCAGAGT CCATAGCCG A 960 
GGCAACTTTG ACTGG AGCCT GACCTTCGGC TGGGAAACAC TTCCTCCACA TGAG AAGCAG 1020 
AGGCGCAAGG ATGAAACATA CCCCATCAAA TCCCCGACGT TTGCTGGTGG CCTCTTCTCC 1080 

10 ATCCCCAAGT CCTACTTTGA GCACATOGGT ACCTATGATA ATCAGATGGA GATCTGGGGA 1140 
GGGGAGAACO TGGAAATGTC CTTCCGGGTG TGGCAGTGTO GGGGCCAGCT GGAGATCATC 1200 
CCCTCCTCTG TCGTAGGCCA TGTOTrCCGG ACCAAGAGCC CCCACACCTT CCCCAAGGGC 1260 
ACTAGTGTCA TTGCTCGCAA TCAAGTGCGC CTGGCAGAGG TCTGGATGGA CAGCTACAAG 1320 
AAGATTTTCT ATAGGAGAAA TCTGCAGGCA GCAAAG ATGG CXXAAGAGAA ATCCTTCGGT 1380 

15 GACATTTCGG AACGACTGCA GCTGAGGGAA CAACTGCACT GTCACAACTT TTCCTGGTAC 1440 
CTGCACAATG TCTA(XCAGA GATGTTTGTT CCTGACCTGA CGCCCACCTT CTATGGTGCC 1500 
ATCAAGAACC TCGGCACCAA OCAATGCCTG GATGTGGGTG AGAACAACCG CGGGGGGAAG 1560 
CCCCTCATCA TGTACTCCTG CCACGGCCTT GGCGGCAACC AGTACTTTGA GTACACAACT 1620 
CAGAGGGACC TTCGCCACAA CATCGCAAAG CAGCTGTGTC TACATGTCAG CAAGGGTGCT 1680 

20 CTGGGCCTTG GGAGCTGTCA CITCACTGGC AAG AATAGOC AGGTCCCCAA GG ACQ AGGAA 1740 
TGGGAATTGG (XCAGGATCA GCTCATCAGG AACICAGGAT CTGGTACCTG CCTGACATCC 1800 
CAGGACAAAA AGCCAGCCAT GGCCCCCTGC AATCCCAGTG ACCCCCATCA GTTGTGGCTC 1860 
TTTGTCTAGG AOCCAGATCA TCCOCAGAGA GAGCCOCCAC AAQCTCCTCA GOA AACAGG A 1920 
TTGCTGATGT CTGGGAACCT GATCACCAGC TTCTCTCGAG GCCGTAAAGA TGGATT TCTA 1980 

25 AACCCACTGO OTOGCAAGGC AGGACCTTCC TAATOCTTQC AACAACATTO GGCCCATTTT 2040 
. CI I J U;riC A CACCGATGGA AGAGACCATT AGGACATATA TTTAGOC TAG CUl 1 11 CCTG 2100 
TTCTAGAAAT AGAGGCTCCC AAAGTAGGGA AGGCAGCTGG GGGAGGGTTC AGGGCAGCAA 2160 
TGCTGAGTTC AAGAAAAGTA CTTCAGGCTG GGCACAGTGG CTCATGCCTG AAATCCTAGC 2220 
ACTTTGGGAA GACAATGTGG GAGAATGGCT TGAGOOCAGO AGTTCAAQAC CGGCCTGAGC 2280 

30 AACATAGTGA GGATCCCATC TCTACGCCCA CCCTCCCCCC GGCAAAAAAA AAAGCTOGGT 2340 
ATGGTGGCTT ATGCCTGTAG TCGCAGCTAC TCAGAAGGCT GAGGTGGGAG GATTGCTTGT 2400 
TCCCCGGAGG TTGAAGCTAC AGTGAGCCTT GATTGTGTCA CTOCACTCCA GCCTGGGCAA 2460 
CAGGTAAGAC TCTGTCTCAA AAAAAAAACA AAAAAGAAGA AG AAAAGTAC TTCTACAGCC 2520 
ATGTOCTATT OCTTGATCAT CCAAAGCAOC TGCAGAGTDC AGTGAAATG A TATATTCTGG 2580 

35 CTGGGCACAG TGGCTCACAC CTGTAATCCT AGCACTTTGG GAGGCCAAGG CAGGTGGATC 2640 
ACCTGAGGTC AGAAGTTTGA AACCAGCCTG GACTACATGG TGAAACTCCA TCTCTACTAA 2700 
AAGTACAAAA ATTAGCTGGG CATGATGGCA CGCACCTGCA GTOCCAGCTA CTTGGGAGGC 2760 
TGAGGCAGGA GAATCACTCG AAOCCAGGAG GCAGAGGTTG CAGTGAGCCA AGACAGCACC 2820 
ATTGCACCCC AGCCTGAGCA ACAAGAGCGA AACTCCATCT CAGGAAAAAA AAAAAAAAAA 2880 

40 a 



Sea ID NO: 66 Protein 



Protctn Accession #: NP_009141 
45 1 U 21 31 41 51 

MRLLRRRlftiP LRLAM VGCAF VLFLFLLHRD VSSREEATEK PWLKSLVSRK DHVLDLMLEA 60 
MNKLRDSMPK LQIRAPEAQQ TLFSINQSCL PGFYTPAELK PFWERPPQDP NAPGADGKAF 120 
QKSKWTPLET OEKEEGYKKH CFNAFASDRI SLQRSLGPDT RPPECVDQKF RRCPPLATTS 180 

50 VIIVFHNEAW STLLRTVYSV LHTTPAHUC EULVDDAST EEHLKEKLEQ YVKQLQVVRV 240 
VRQEERKGLI TARLLGASV A QABVLTFLDA HCECFHGWLE PIXAMAEDK TVVVSPDTVT 300 
IDLNTFEFAK PVQRGRVHSR GNFPWSLTFG WETLPPHEKQ RRKPETYHK SFTFAGGLFS 360 
1PKSYFBH1G TYBNQMHWG GENVEMSFRV WQCGGQLHn PCSWGHVFR TKSPHTFPKX} 420 
TSV1ARNQVR LABVWMDSYK KDrYRRNLQA AKMAQEKSFG WSHRLQLRB QLHCHNFSWY 480 

55 IHNVYPEMPV PDLTPTFYGA BCNLGTNQCL DVGENNRGGK PUMYSCHGL GGNQYFEYTT 540 
QRDLRHNIAK QLCLHVSKGA LGLGSCHFTG KNSQVPKDEB WELAQDQLIR NSGSGTCLTS 600 
QDKKPAMAPC NPSDPHQLWL FV 



^ Scq ID NO: 67 DMA s_- , 

60 Kudeic Add Accesskm fc NMJH41 12 

Consequence: 600-4484 (traded^ 

1 11 21 31 41 51 

65 mCTCCGCG AAGGCTOCTTTGATATTAAT AGTGTTGGTG TCTTGAAACT GACGTAATGC 60 
GCGGAG ACTG AGGTCCTGAC AAGCGATAAC ATTTCTOATA AAGACCCGAT CTTACTGCAA 120 
TCTCTAGCGT CCT LTl l i l l GGTGCTGCTG GTTTCTCCAG ACCIU3CGTC CTCTCGATTG 180 

cicitnaKx:Ticcr Amt:Tiuuiiii 1 1 1 iaaacaa aaaacaacaccccctcccct 240 

CTCCCACCCG GCACCGGGCA CATOCrTGCX CTA lTiLXjl 1 1 H1C 'H'lCT CICTC 300 
70 TCTCmTTT AATAAGGGTG GGGGAGGGAA AGGGGGGGG A GGCAGGAAAG ACCTTTTTCT 360 
CTCCCCCCCG CAATAATCCA AGATCAACTC TGCAAACAAC AGAAGACGGT TCATGGCTTT 420 
GGCCGCCGCG CCACCATCTT TCGGGCTGCC GAGGGTGTTC TTGACGATTA ATCAACAGAT 480 
GTACAGATCA GCTCTCAAAA TUTICA' 1CTQT WITH ' IO AO Uf 1C1 ICTAA GACAATTGCA 540 
TTAGCCTCCT GCTAGTTGAC TAATAGAATT AATAATTGTA AAAAGCACTC TAAAGCCACA 600 
75 TGCCTT ATG A AGTCAATGCT GGGTATGATT TTACAAATAT GGTCOGGAAA AAGAACCCCC 660 
CTCTGAG AAA CGTTGCAAGT GAAGGCGAGG GCCAGATCCT GOAGCCTATA GGTACAGAAA 720 
GCAAGGTATC TGGAAAGAAC AAAGAATTCT CTGCAGATCA GATGTCAGAA AATACGG ATC 780 
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AGAGTG ATGC TGCAG AACTA AATCATAAGG AGGAACATAG CTTGCATGTT CAAGATCCAT 840 
CTTCTAGCAG TAAGAAGGAC TTGAAAAGCG CAGTTCTGAG TGAGAAGGCT GGCTTCAATT 900 
ATGAAAGCCC CAGTAAGGGA GGAAACTTTC CCTtXTTTCC GCATQATGAG GTGACAGACA 960 
GAAATATOTT GGCTTTCTCA TTTCCAGCTG CTGGGGGAGT CTQTGAGCOC TTG AAGTCTC 1020 
CGCAAAGAGC AGAGGCAGAT GACCCTCAAG ATATGGCCTG CACCCCCTCA GGGGACTCAC 1080 
TGGAGACAAA GGAAGATCAG AAGATGTCAC CAAAGGCTAC AGAGGAAACA GGGCAAGCAC 1140 
AGAGTGGTCA AGCCAATTGT CAAGGTTTGA GCCCAGTTTC AGTGGCCTCA AAAAACCCAC 1200 
AAGTCCCTTC AG ATGGGGGT GTAAGACTGA ATAAATCCAA AACTGACTTA CTGGTGAATG 1260 
ACAACCCAGA CCCGGCACCT CTGTCTCCAG AGCTTCAGGA CTTTAAATGC AATATCTGTG 1320 
GATATGGTTA CTACGGCAAC GACCOCACAG ATCTGATTAA GCACTTCCGA AAGTAT CACT 1380 
TAGGACTGCA TAACCGCACC AGGCAAGATG CTGAGCTOGA CAGCAAAATC TTGGCCCTTC 1440 
ATAACATGGT GCAGTTCAGC CATTCCAAAG ACTTCCAGAA GGTCAACCGT TCTGTGI 111 1500 
CTGGTGTGCT GCAGGACATC AATTCTTCAA GGCCTGTTTT ACTAAATGGG ACCTATGATO 1560 
TGCAGGTGAC TTCAGGTGGA ACATTCATTG GCATTGGACG GAAAACACCA GATTGCCAAG 1620 
GG AACACCAA GTATTTCCGC TGTAAATTCT GCAATTTCAC TTATATGGGC AACTCATCCA 1680 
CCGAATT AGA ACAACATTTT CTTCAGACTC ACCCAAACAA AATAAAAGCT TCTCTCCCCT 1740 
CCTCTGAGGT TGCAAAACCT TCAGAGAAAA ACTCT AACAA GTCCATCCCT GCACTTCAAT 1800 
CCAGTGATTC TGGAGACTTG GGAAAATOOC AGOACAAG AT AACAGTCAAA GCAGGAGATG 1860 
ACACTCCTGT TGGGTACTCA GTGCCCATAA AGCCOCTCGA TTOCTCTAGA CAAAATGGTA 1920 
CAGAGGCCAC CAGTTACTAC TGGTGTAAAT TTTGTAGTTT CAGCTGTGAG TCATCTAGCT 1980 
CACITAAACT GCTAGAACAT TATGGCAAGC AGCACGGAGC AGTGCAGTCA GGCGGCCTTA 2040 
ATCCAGAOTT AAATGATAAG CTTTCCAGGG GCTCTGTCAT TAATCAO AAT GATCTAGCCA 2100 
AAAGTTCAGA AGGAGAGACA ATGACCAAGA CAGACAAGAG CTCGAGTGGG GCTAAAAAGA 2160 
AGGACTTCTC CAGCAAGGGA GCCGAGGATA ATATGGTAAC GAGCTATAAT TGTCAGTTCT 2220 
GTGACTTCCG ATATTCCAAA AGCCATGGCC CTGATGTAAT TGTAGTGGGG CCACTTCTCC 2280 
GTCATTATCA ACAGCTCCAT AACATTCAGA AOTOTACCAT TAAACACTGT CCATTCTGTC 2340 
CCAGAGGACT TTGCAGCCCA GAAAAGCACC TTGGAGAAAT TACTTATCCG TTTGCTTGTA 2400 
G AAAAAGTAA TTGTTCCCAC TGTGCACTCT TGCTTCTGCA CTTGTCTCCT GGGGCGGCTG 2460 
GAAGCTCGCO AGTCAAACAT CAGTGCCATC AGTGTTCATT CAOCACOCCT GACGTAGATG 2520 
TACTCCTCTT TCACTATG AA AGTGTGCATG AGTCCCAAGC ATCGGATGTC AAACAAGAAG 2580 
CAAATCACCT GCAAGGATOG GATGGGCAGC AGTCTGTCAA GGAAAGCAAA GAACACTCAT 2640 
GTAOCAAATG TGATTTTATT ACCCAAGTOG AAGAAG AG AT TTCCCGACAC TACAGGAGAG 2700 
CACACAGCTG CTACAAATGC CGTCAGTGCA GTTTTACAGC TGCCGATACT CAGTCACTAC 2760 
TGGAGCACTT CAACACTGTT CACTGCCAGG AACAGGACAT CACTACAGCC AACGGCG AAG 2820 
AGGACGGTCA TGCCATATCC ACCATCAAAG AGGAGCCCAA AATTOACTTC AGGGTCTACA 2880 
ATCTGCTAAC TCCAGACTCT AAAATGGGAG AGCCAGTTTC TGAGAGTGTG GTGAAGAGAG 2940 
AGAAGCTGG A AG AGAAGGAC GGGCTCAAAG AGAAAGTTTG GAOCGAGAGT TCCAGTGATG 3000 
ACCTTCGCAA TGTGACTTOG AOAGGGGCAG ACATCCTOCO GGGGAGTCCG TCATACACCC 3060 
AAGCAAGOCT GGGGCTGCTG AOGCCTGTGT CTGGCACCCA AGAGCAGACA AAGACTCTAA 3 120 
GGGATAGTCC CAATGTGGAG GCCGCCCATC TGGCGCGACC TATTTATGGC TTGGCTGTGG 3180 
AAACCAAGGG ATTCCTGCAG GGGGCGCCAG CTGGCGGAGA GAAQTCTGGG GCCCTCCCCC 3240 
AGCAGTATCC TGCATCGGGA GAAAACAAGT OCAAGGATGA ATCCCAGTCC CTGTTAOGGA 3300 
GGCGTAGAGG CTCCGGTGTT TTTTGTGCCA ATTOCCTGAC CACAAAGACC TCTCTCTGGC 3360 
GAAAGAATGC AAATGGCGGA TATGTATGCA ACGCGTOTGG CCTCTAOCAG AAGCTTCACT 3420 
CGACTCCCAG GCCTTTAAAC ATCATTAAAC AAAACAACGG TGAGCAGATT ATTAGGAGGA 3480 
GAACAAGAAA GCGCCTTAAC CCAG AGOCAC TTCAGGCTGA GCAGCTCAAC AAACAGCAGA 3540 
GGGGCAGCAA TGAGGAGCAA GTCAATGGAA GCCCGTTAGA GAGGAGGTCA GAAGATCATC 3600 
TAACTGAAAG TCACCAGAGA G AAATTCCAC TOCCCAGCCT AAGTAAATAC GAAGCCCAGG 3660 
GTTCATTG AC TAAAAGCCAT TCTGCTCAGC AGCCAGTCCT GGTCAGCCAA ACTCTGGATA 3720 
TTCACAAAAG GATGCAACCT TTGCACATTC AGATAAAAAG TCCTCAGGAA AGTACTGG AG 3780 
ATCCAGGAAA T AGTTCATCC GTATCTG AAG GGAAAGGAAG TTCTGAGAGA GGCAGTCCTA 3840 
TAGAAAAGTA CATGAGACCT GCGAAACACC CAAATTATTC ACCACCAGGC AGCOCTATTG 3900 
AAAAGTACCA GTACCCACTT TTTGGACTTC CCTTTGTACA TAATGACTTC CAGAGTGAAG 3960 
CTGATTGGCT GCGGTTCTGG AGTAAATATA AGCTCTCCGT TCCTGGGAAT CCGCACTACT 4020 
TG AGTCACGT GCCTGGCCTA OCAAATOCTT GGCAAAACTA TGTGCCTTAT (XCACCTTCA 4080 
ATCTGCCTCC TC A1 H MC A GCTGTTGGAT CAGACAATG A CATTCCTCTA GATTTGGCGA 4140 
TCAAGCATTC CAGACCTGGG OCAACTOCAA ACGGTGCCTC CAAGGAGAAA ACGAAGGCAC 4200 
CACCAAATGT AAAAAATGAA GGTCCCTTGA ATGTAGTAAA AACAGAGAAA GTTGATAGAA 4260 
GTACTCAAGA TGAACTTTCA ACAAAATGTG TGCACTGTGG CATTGTCTTT CTGGATGAAG 4320 
TGATGTATGC TTTGCATATG AGTTGCCATG GTGACAGTGG ACCTTTCCAG TGCAGCATAT 4380 
GCCAGCATCT TTGCACGGAC AAATATGACT TCACAACACA TATCCAG AGG GGCCTGCATA 4440 
GGAACAATGC ACAAGTGGAA AAAAATGGAA AACCTAAAGA GTAAAACCTT AGCACTTAGC 4500 
ACAATTAAAT AGAAATAGGT TTTCTTQATG GGAATTCAAT AGCTTGTAAT GTCTTATGAA 4560 
0 ACCTATTAA AAAAATACTT CATAGAGCCT GCCTTATCCA ACATGAAATT CCCTTCTTTT 4620 
GTTATTCTTT CTTTTGATGA GTAGGTTACC AAGATTAAAA AGTGAGATAA ATGGTCAATG 4680 
AGAAAGAATG GAAGATGGTA AACAATCACT TTTTAAAACC TGTTAAGTCA AAACCATCTT 4740 
GGCTAATATG TACTGGGGAA ATAATCCATA AGAGATATCA CCAGACTAGA ATTAATATAT 4800 
TTATAAAGAA AGAGACCAAA ACTGTCTAOA ATTTGAAAGG GTTTACATAT TATTATACTA 4860 
AAGCAGTACT GOACTOGCCA TTGGAOCATT TGTTCCAAAA CCCATAAATT GTTGCCTAAA 4920 
TTTATAATOA TCATGAAACC CTAGGCAGAG GAGGAQAAAT TGAACGTCCA GGGCAATGAA 4980 
AGAAAAATGG CGCCCTCTCA ATTTAGTCTT CTCTCATTGG (XATCTTTCA GATTTTGACC 5040 
TAGAAATGCG AGCTGTGGTT AGGCTTGGTT AGAGTGCAGC AAGCAACATG ACAGATQGTO 5100 * 
GCACG CRi 1 1 1 1 lACCCAGC CCTGCCTOTA C ATACACATG CACACCCTCT CTGATATTTT 5160 
TGTCCTTTAG ATGTTCAAAT ACTCAGTAGT CCTTTTGTTT GCGGTTTAGA TTCATTTTGT 5220 
CCACACATGT ACCCATTTTA AAAAACAATG TCCTCGATGC TTCTOTAGTO ATTTCATTTT 5280 
AGCCAGGTAT TTCTTTCTTG TGTGTGATG A ACCAGTATGG ATTTGCTTTT CTAAGCCTCC 5340 
TOTTGGTTAC TAATCTCACT TGGCACATTA TAACTAAAGG AATCCCCTCA ATTCAAAAGC 5400 
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ATAGATGGAT ACAAATGTCA GACCGTGCGT TTAATTTOTT TAGAACACAT OOCATTTCTT 5460 
CACAAGGTAA CCTGCTGTAT TTATTTATTT TCTTTTGGTT AAATATAATT TCCAAACTTT 5520 
GTGGTCAGGC AGCGTCTAAG GTTAOGTTAC CACAGACTG A CAGTTGGTAT ATGTA(XAGC 5580 
CAATCCCTTC ATTAAATGTA TACAGATTTA GTTAAGTAGC ATTAAATAGG ATTCTTAGAA 5640 
5 GTATGTCCTC ATAGAACTTT TAATACTTAA GGCTTTGTAA AAACTA^TGAAGGGAAA 5700 
OCTCCTCAGC ATAACTOCTC AGGGAAATAG GGCTAAATAA CTGAACATTA AATAATTGGT 5760 
TAAAGOTGCT GTTAGTCGAG CCTCAATGCT TGCTACAAGG ATGTATGTAC AAGGACTGAC 5820 
TTTAATAATT TGCATTATAT TGTCCCAACC AGTAGTTTAT TTTTTGCCAC GGAGATGTAO 5880 
AAGATATTAC AAGCTACTGG ATGCACTGTC AGATTAACTT ATTTCATTAA AGAAGTTGGG 5940 

10 AG AACAAATA GG AAAAAAAA AACTTATTTT TCTAGTAAAT ATTAATGTAT TACATTTCAA 6000 
ATAATGGTGC CTG ACATATT GAATAATTAT TTTCTACAGT GTACGTATGC AACAAAGATA 6060 
TTCCATCATG CATTAGAGTC AGTTCTGGCT CTGOCTAGCT GTTTACATTT OCAAATGTAG 6120 
CAAACAAGGT AATGAAGCAA CTATTTCTAT TGCAGTAG AT ATCCTTTTGT GTGTGTGTGT 6180 
GTGCATTAAA GTTGTAAAOG GTAACATGAA ACAAATGAAA GTTCTTGCTA TAATGGTATG 6240 

1 5 GAAAACAAGA AGGAAATGAA AATATTTTTA TGCCTACTTA GGAAAAAAAG GGTAGCACTT 63C0 
ATTCATTCCA AGTACTTTTT TTTTTTTAAT TTTTAAGCTC TT AACTCA CA TTGTTATGCT 6360 
TAAG ATG ATA AACATATATC CTCTTTTTAT TOCTTTGTCT ATGT TTCATA T GAAACATTT 6420 
CAGAAATTAT TTTGATAAGT GTTGCTGGAA TCTGCAACGC ^^EJ^^^ESIP 
TAGTCGCATT TGCACTCCAT TTTTACATTA ATTCGCAGTT GCTTTOTATC ATTGTTTTGT 6540 

20 TTGGGTTTTG TnCIU UC ACAGTGCCGO GTCTTCGTTT CTTAAAGTTG GAT^CAOTT66C0 
AGAGTTCAAC CAGTTCGTGA CTGTTGTAGC GAATGAAGTT AAAAAAATGT CTTTCTGATG 6660 
TTGTGTTGTC ATTTTCATTT TTGCATTTTT TTGTTTGCAT ATTAAAAAAA G AGAAAAGAG 6720 
AAAGCAAGAG ACAGAAATCA GGACTAAGTC CTCTGCTTCA GTTTCATTGT TAACGGGCCT 6780 
TATTCTGATC TCACCTOTCO CX3TAGCTCTA ATATTCACAT AAACTGAAAT AA AGAAGT GG 6840 

25 AATGAGGAGC TTTGACATTC AAATTATGTG ATGTAATTTA TCTTCCTTAG <^y™ A * f** 

GGATGCATCT CAAAATGTAT AGCCAGACTT GAG AGGTGAC AATTAAAGAT CTAAAAAAG A 6960 
GAGGAGATTC CCCCAAACAA CAATATTTAA TTTTCTTAGT AAAAAGAATA ACAGAATGCA 7020 
TCGTGGCAAT CCTTAAGCAA CATTATCTAT GTGG ACTGCT TAAATCAGCA AAACACCAGA 7080 
AGTTTGGTTA ACTTGGGCAA TATGACAAGT ATTACH 111 GGGCAAAACT A CTCATTAA O 7140 

30 CAATTTCTCT AGTGTGTCGG ACACAAATAG GTTCTTTATT TTTGGCATGT ATGCCTTTTT 72CO 
ATTTTT!ATTC AA' 1 1 1 H II 1 11 1 lUCAOA CAGACATAGT AGTATCAACT AGCATTGGAA 7260 
AATACATATC ACTATTCTTG GAATATTTAT GGTCAGTCTA CTTTTTAGTA AAATATTTTT 7320 
GGATAGOGTT GACACG ATAG ATCTTATTCC ATACTTCTTT ATTATTGATA ATTTTATTTT 7380 
CATTTTTTGC TTTCATTATT ATACATATTT TGGTGG AGAA GAGGTTGGGC TTTTTTG AAA 7440 

3 5 GAGACAAAAA TTTATTATAA CACTAAACA C TCCT TTTTTO ACATATTAAA GCCmATTC 7500 
CATCTCTCAA O ATATATTAT AAAATTTATT TTTTTAATTT AAGATTTCT G ^*J[™^™L 
TCTTAAATTG TGATTTTAAA CGAGCTATTA TGGTACGGAA CTTTTTTTAA TGAGGAATTT 7620 
CATGATG ATT TAGGAATTTT CTCTCTTGGA AAAGGCTTCC CCTGTG ATGA AAATGATGTG 7680 
CCAGCTAAAA TTGTGTGCCA TTTAAAAACT GAAAATATTT TAAAATTATT TGTCTATATT 7740 

40 CTAAATTGAG CTTTGGATCA AACTTTAGGC CAGG ACCAGC TCATGCOTTC TCATTCTTCC 78CO 
TTTICICACr CI 1 Ill CI C A TCACICAOCT CTGTATTCAT TL'fU I'Ui I IT GGGATAGAAA 7860 
AATCATAAAG AGCCAACCCA TCTCAGAACG TTGTGGATTG AGAG AGAC AC TACATGACTC 7920 
CAAGTATATG AGAAAAGG AC AGAGCTCTAA TTGATAACTC TGTAGTTCAA AAGGAAAAGA 7980 
GTATGCCCAA TTCTCTCTAC ATGACATATT GAG ATTTTTT TTAATCAACT TTTAAGATAG 8040 

45 TGATGTTCTG TTCTAAACTO TTCTGTTTTA GTGAAGGTAG ATTTTT ATAA AACAAGCATG 8100 
GGGATTCTTT TCTAAGGTAA TATTAATGAG AAGGGAAAAA AGTATCTTTA ACAGCTCTTT 8160 
GTTGAAGCCT GTGGTAGCAC ATTATGTTTA TAATTGCACA TGTGCACATA ATCTATTATG 8220 
ATCCAATGCA AATACAGCTC CAAAAATATT AAATGTATAT ATATTTTAAA ATGCCTGAGG 8280 
AAATACATTT TTCTTAATAA ACTGAAGAGT CTCAGTATGG CTATTAAAAT AATTATTAGC 8340 

50 CIUJ f GTTGT GTGGCTGCAA AACATCACAA AGTGACCGGT CTTOAQACCT GTQAACTGCT 8400 
GCOCTGTTTA GTAAATAAAA TTAATGCATT TCTAGA^OGG GAATATCTGC CATOCACT^ 8460 
TGGAAATGTG GAGTAAAOAA GCTGGTGGTC TGCTTCTGTG CTGTATGCCA GCCTTTTGCC 8520 
TTAAGTTGAG AGGAGGTCAA CTTTAGCTAC TGTCTTTGGT TTGAGAGCCA TGOCAAAAAA 8580 
AAAAAAAG AA AAAAAGATCA AGTCGTCTTT GGTGAGCCAO TAAGGTGAAA GCTTGCTGAC 8640 

55 TGTCCAAGGC ACAAGAGAAA ATTGAGGAAT TOAAATGCAA CCTGAGTATC AAACTAAATA 8700 
TTCTAATCAA AGGTAGGTAC TGTTAGGTGG AATTCTATCA GCAG GCAACT CCAAATQ AOA 8760 
AGAAGATAGA AGGAOGCOCG TCGGGACTTT GGAGGGCATT OTTAT TTTCC CAAAGAAAGA 8820 
CGGCCAAGGG CAGAGGCATG GATTCTTTGC AGAGCACTTC CTTTTGOTTT TTCAGTACTG 8880 
TTTCATAOAC AGTGGGCTCA CATGTTOCTG ATAGTGCTGC AGTTGCTTAG AAAGCATCCC 8940 

60 AGTTAATTGC AGTAATTAGA ACTTCTGGAA TATGCTAGGG CAGAAGTATG TCAAGTATGT 9000 
CACATGAAGA AAATGTG AAA TTCAAGAGTA ATCCACACGT QAOAAACTAG ACAATGTACA 9060 
TTCATGTGTT CTCTTGAAAG GAAAGGGAGA GCTGTAAGCT TCACTCTGTC CTACACCGGA 9120 
flAAAAGCAGG AATAACTTTA CCGTGQAAAT AATGTTTAGC TTTTATCAG A GAAAATTGTC 9180 
CTTCTAGAGC ATAGAGTCCC AAAACTCAAT TCTGOTTTTC OCCTGTTTTT nmmn 9240 
65 TTTTTCCCAA CATATGAACT GCAGCATATC ACTTTTTCTT TTTGTGCCTC AGGTTCCTCA 9300 
CCTGTAAAAT TO AAAAATAT ATGTATTAAT AATATTATTA ATAATAATAA TGGTAATGTA 9360 
GTACTTGTTT GTAAAGCACT TTGAGATCCT TGGrrTGAAAO GCACCATAGG AGTGCCAAGT 9420 
ATTATTATGT GGCCAAGGGG GTTATTTAAA CTGTCAGTTC CCAAAGGCCA GGAAAG GTTQ 9480 
GGGTCATTTT TCTTAAAG AC G AGCTOTAAA TATCAACTAG GCAGCCAATA GTGTTGACTA 9540 
70 TGAAGATGCA AAACTATTAC TAGGCTGATA AAATCATAGT TTCTTAATOG CTACCAATAA 9600 
GCCAAATATC ACAATAATAA ACGCCAAATT OCTTAGGGCG GACTATTTGA CAACCACATG 9660 
GAAAACTTTG GGGGAGGCAT GAGGGGGGAA CATCTCAAAA TGCCAATGTA AAA TTTAACT 9720 
TACAGCAATA TTCACCAGCA GAAAATGTCT TTCATATGG A ATGATTTCAT GTTGCTAAOA 9780 
AAAAGAATTC AATTTGTAGT CCTGATTTGA ATACTAGAAT GTTGGCTATA AT AGTTC TGT 9840 
75 TCTTACAACA CATGAAATTT TTTCGTTTTA TTTTATTTTG TTTTCATAGT GCATGTTCAT 9900 
TTCTACTCAC AAACATGTTC TTGGTGTATT TCTTATGCAA ACAATCTTCA GGCAGCAAAG 9960 
ATGTCTGTTA CATCTAAACT TGAATAATAA AGTTTTACCA CCAGTTACAC A 
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Seq ID NO: 68 Protein 



Proteb Accession fc NPJD5483 1 

1 11 21 31 41 . 51 
I I I I I 



MPYEVNAGYD FTMMVRKKNP PLRNVASEGE GQILEPIGTE SKVSGKNKEF SADQMSHJTD 60 
QSDAAELNHK EEHSLHVQDP SSSSKKDLKS AVLSEKAGFN YBSPSKGGNF ^^^^^P 
RNMLAFSFPA AGGVCEPLKS PQRAEADDPQ DMACTPSGDS LBTKEDQKMS PKATEBTGQA 180 
10 OSGOANCQQL SPVSVASKNP QVPSDGOVRL NKSKTDLLVN DNPDPAPLSP HLQDFKChaC 240 
GYG^GNDFT DLDCHFRKYH LGLHNRTRQD AELDSKDLAL HNMVQFSHSK DFQKVKRSVF 300 
SGVLQDINSS RPVLLNGTYD VQVTSGGTFI G1GRKTPDCQ GNTKYFRCKF CNFTYMGNSS 360 
TELEOHFLQT HPNKDCASLP SSEV AKPSEK NSNKSIPALQ SSDSGDLGKW QDKTTVKAGD 420 
DTPVGYSVPt KPLDSSRQNG TEATSYYWCK FCSFS^SSS SLKLLEHYGK QHGAVQSGGL 480 
15 NPELNDJOSR GSVINQNDLA KSSEGETMTK TDKSSSGAKK KDPSSKGAED NMVTSYNCQF 540 
CDFRYSXSHG PDVIWGPLL RHYQQLHNIH KCTOCHCPFC PRGtCSPEKH LGKTTYPFAC 600 
RKSNCSHCAL UXHLSPG AA GSSRVKHQCH QCSFTTPDVD VLLFHYESVH BSQASDVKQE 660 
ANHUXjSDGQOSVKBSKEHSCIK^ 720 
I^ffNTVHO? EQDITTANGE EDGHABTIK EEPKIDFRVY NLLTPDSKMG EPVSBSWKR 780 
20 EKLEEKDGLK EKVWTESSSD DLRNVTWRGA DILRGSPSYT QASLGLLTPV SGTQEQTKTL 840 
RDSPNVEAAH LARPIYGLAV ETKGFLQGAP AGGEXSGALP QQYPASGENK SKDESQSLLR 900 
RRRGSGVFCA NCLTTICTSLW RKNANGGYVC NACGLYQKLH CTPRPLNHK QNNGBQIIRR 960 
RTRKRLNPEA LQAEQLNKQQ RGSNEEQVNG SPLERRSEDH LTESHQREIP LPSLSKYBAQ 1020 
GSLTKSHSAQ QPVLVSQTLD WKRMQPLHI QDCSPQESTG DPGNSSSVSE GKGSSERGSP 1080 
25 HXYMRPAKH PNYSPPGSPI EKYQYPLFGL PFVHNDPQSB ADWLRFWSKY KLSVPGNPHY 1 HO 
LSHVPGLPNP CQNYVPYPTF NLPPHFSA VG SDNDIPLDLA DCHSRPGPTA NGASXEJOKA 1200 
PPNVKNEGPL NWKTEKVDR STQDELSTKC VHCGIVFLDE VMY ALHMSCH GDSGPFQCSI 1260 
CQHLCTDKYD FTTHIQRGLH RNNAQVEKNG KPKE 

30 Seq ©NO: 69 DNA sequence 

NocJek Acid Accession #: XM 073879 - 

Ctodfadj seqiitnce; M87(uDdtrimed 

1 11 21 31 41 5t 

ATGGGGTTTG GAGACCAGGG AACGGTGGAA GGGAGCCTAG OAACGTOGAA AAAACCACCT 60 
GAAGTGAAAA TOTTTCG AGC CAGTCAAGGT TTGCTGACAA TGGAAACAAA CCAGTCCCTG 120 
GCACAAGGCA CAGGCTGCTC AGTAGTAAAG GTAGACACTG TTCTCTTCGA GAGTTTATAC 180 
CACTGCGGCT TTGAACAOGG GAGOGTGATG CACTGCCTTG GGGATOATCA CCCCCAGQAA 240 
40 GACAGGAAGG CTCACTTCTC TGCCCCAGTT GCAGCCATCG CCTCTCCAGC AOCGACTCCT 300 
GTCTCTCCTO CACACCACTC AACACAG AGC ATCTGOCAGT TTCTACAGCA CTGCAGGCAG 360 
AACACTCACT TGCAGGCTGC TAA CTAA 



Seq ED NO: 70 Proteins 

45 Proteb Accession 9t XP_073879 

1 11 21 31 41 51 

MGFGDQGTVE GSLGTSKKPP EVKMFG ASQG LLTMETNQSL AQGTGCSWK VDTVIJESLY 60 
50 HCGFEHGSVM HCLGDDHPQB DRKAHFSAPV AAIASPAPTP VCPAHHSTQS ICOFLQHCRQ 120 
NTHLQAAN 



ScqIDNO:71DNAs 

NocJefeAcid Accession ft AB033064 
55 sequence: 826-1986 (imderiiDedseqiiei^ 

1 11 21 31 41 51 

GGGGAACA GC aLtTCAGTC ACAGACAATC CTCCACTCGG TCAAGAGCCA CTnTCTOT 60 

60 CCTGCCITGC CCCCCCGCAG GGGGTAAGGA ACTGAGCGTT TAATCTTTAG CCGGTTGGCT 120 
ACCAGCTAAA ATTCTACTTA TCTTAGTTTC TAGTGGATAG CTTTCTTATT TTGCCCATGT 180 
TTTCTTAGAA TCCCTGTTTA ATATACTTTT GTCAGTAGTA GTATCTAGGA GTAOCAGGGA 240 
GAGTGACAAT AAATTAGCCC U 1 UH11I1 CCCTTGTCAT TCAGGCCCCT TTTCCTCJCC 300 
AG AGGQAAAT TACCAGTAAA CTCTTCTAAA TCTTCCACCC CTTCTCAGTC ATACTGTGAA 360 

65 GAAACACACT AAAGTGGACA TTATTTGACC AGTGAACACG AACCCAGCTT CAGGCATTGG 420 
TTTGTTGTGG CACATGGAGA AACATCTCTT TTAAAATATC TOCCAATTAC CCTTTTCACA 480 
ATTTGTATOC ACCTAGG ATT TGCTGCTGGG GTAAGTCACT AGATTTATIT CTCAAAGCTC 540 
CCCTCrcrAT GAOCTG AAAG ACTGACCAAC CATG AACACT AGTAG GCGAT GGOQAAAGGG 600 
G ACAGAGCAG AGCCAXjTTGT TCCACACTTT GGGAAGCAGG AGTAGCTTTT ATCATCTTCC 660 

70 tctggggagc AGGCATAGAG ACATAAACTG agtgaaaatg GGTGOAGGAA GAACTTCTAT 720 
ACCCACGAAC AACATGTGAA GAGAGAGAAC CAAACATAAA GTAAGGAGGG TAGACGTTAC 780 
ATCCAAGAGG AAATAATCCA GGCAAGGAAG CACAAGCTGA TCAAGATGTG TAGTTCTGTG 840 
GCTGCCAAGT TGTGGTTTTT GACAGATCGT CGCATCAGGG AAGACTATOC TCAAAAAGAG 900 
ATTTTACGAG CATTG AAGGC CAAATGTTGT GAGGAGGAAC TGGACTTTAG GGCTGTGGTG 960 

75 ATGG ATCAGG TGGTGCTGAC AATCGAGCAA GGAAAOCTGG GTCTGCGGAT CAATGGAGAG 1020 
CTAATCACTQ CCTACOCACA AGTGGTGGTA GTCAG AGTAC CAACCCCTTG GGTGCAAAGT 1080 
GATAGTGACA TCACTGTTTT GCGCCATCTA GAGAAGATGG GATGTCGGTT AATGAACCGA 1 140 
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CCTCAAGCCA TCCTGAACTG CGTTAATAAG TTCTGGACAT TTCAAGAGTT GGCTGGCCAT 1200 

GGTGTTCCTC TGCCGGATAC TTTCTCTTAT GGTGGCCACO AAAATTTTGC TAAAATGATT 1260 

GATGAGOCTO AAGTTCTGOA GTTCCCAATG OTAGTAAAOA ATACGCOGOQ TCACAGAGGT 1320 

AAAGCTGTTT TCTTGGCTCG AGATAAGCAC CATTTGGCTG ATCTAAGCCA TCITATTCGC 1380 
5 CATGAAGCGC CATACCTGTT CCAGAAGTAT GTTAAAGAGT CTCATGGACG GGATGTACGT 1440 

GTCATTGTCG TGGGAGGCCG TGTGGTTGGC ACCATGTTAC GTTGTTCAAC AGATGOGAGA 1500 

ATGCAAAGCA ACTGCTCATT AGGTGGTGTG GGGATGATGT GCTCATTGAG TGAACAAGGG 1560 

AAGCAGCTAG CTATCCAGGT GTCTAATATC CTGGGGATGG ATGTGTGTGG CA TTOAC CTO 1620 

CTGATGAAAG ATGACGGCTC CTTCTGCGTC TGTGAGGCCA ATGCAAATGT AGGTTTCATC 1680 
10 GOCTTTGATA AGGCTTGTAA TCTAGATGTA GCTGGTATCA TAGCAGACTA TGCCGCCTCC 1740 

CrrCTACCCT CTGGCCGGCT CACCCGGCGT ATGTCCCTGC TCTCCGTGGT GTCCACTGCC 1800 

AGTGAGACTA GTGAGOOGGA GCTGGGTOOC CCAGCCAGCA CTGCTGTTGA CAACATGAGT 1860 

GCAAGTTCCA GCTCTGTIGA CAGCGACCCTGAAAGCACGG AGCGAGAGCT GCTCACCAAG 1920 

CTOCCAGGGQ OCCTOTTCAA CATG AACCAG CTGCTAQCCA ATOAAATCAA ACTACTGGTG 1980 
15 GACTGACTCC ACTGGTAATT AACCAACAAA ACCCITGTAA AACJ11U11T CTTCnTTCT 2040 

ATTTTTAAAA CCAACTTGCA ATGCTGTTCA TGGAGGATGC TCAGGAAG AT GAGAGAAAAT 2100 

TAGTAGGATT AGTTGGAGAG AGTGGGAGAT AGATGAGACC TCTGCTAGTA AGATGTTACT 2160 

TTCATTTACA AATOCTACAA ATAG AO AGGC AG AATAGGTO GGGTATAG AA AAATGTCAGG 2220 

CTCTCATAGT TACCCTTTTA AATTGCTAAA AAATGTGTAT GCTCATAGGC CATGAGGAAC 2280 
20 AAATA C till IHU1 IC AT GGTCCCTTGC TIT1UI 1111 GTACAA AAAA AAATGGTTTT 2340 

GCTACAAATA TCCAAGTAGC ATAACTTCAC ATTGTGTTGG AAGATTTOTC ATCAGTGAGG 2400 

AAAACATCTG CTTAAATTAC AGGAATTTTT GTATT ATACA GCTCTGAAAA TTCTGCCATT 2460 

TCCTTATTAA CTAGCAGCTT TAGTTTGTAG TTTATGAAAT CTTGAGGGGC TCTTTTACTG 2520 

GGATTTCTTA TTTTTTTGTT TTTTCCCGCT TAATTTGGTG GGAGGTCAAA TTGAATATAA 2580 
25 CCCAATAAAG GCTTCTTAAT GACAAAATTG GCATGTTTGC ATOATGAAAT GGAAATGAAC 2640 

AGTATTGCAA TGTOCGGTAT ACAAAATAAC ATTAATTCAA TGTAGATAAA ATTACACTAG 2700 

TTTAAAATAT GTGCATTCAC TTGTATTTGT TAGTGTTTTA GTCTTTTTTG AAAGATGTGC 2760 

TCTGTTAATG TTG C1 11111 1111111111 TAATACATGC TAGTCTAACA TTTCCTGCTC 2820 

TATGCCTGCA TCTTTAACAA TGGCCAAAGT GAAGAAAATG CTACCTTTTT TGTTA ACAAG 2880 
30 ACACTGACTT GAAACATGTA CATTTAAAGC CTTTTATTTT TTCCCTTTTT GTTTTGGTAG 2940 

TTGGGCATTT AAATAAGGAC AAGGAAAAAT ATTTTTGGGG GCAAATCAAG AGCCTATGAG 3000 

TTCTAAGTAT AAAGCTGAAG TGATTTCOAA TGCCAGCGTT ATATATTTGC AaaaaauACA 3060 

TTTTACGAGG GAGTATATGT GTATGTGTGT GCACGCATGC ATGTGTATGT GTTTTGCTTT 3120 

TTGTTTCCAT CAACTAATCA AAAAGGATAA TTTAGAAAAT GGAGCATG AT GGGAAACAGA 3180 
35 GTTTTTGACT TTAAAAAACA GATGAGTTGT TTTCATAAGT AG ACTCCACT GGGGTAGAGG 3240 

TATTCACCTT AAAACATAGG GTQAGTAGAT GCTTTTTTAG GCCTTTTTGT GTATATGTAC 3300 

GllUlllOU 1111 iCC 1 1 1 TCTTTCTAOC CTGTTCAGTG TACAGTTTAT TCAAGGCTAC 3360 

ATGCTTTTCTTTAATGCTTC TGGCTATGCA 1 1 1 1U1LU 1 TTACATATAO GATTTQOGAT 3420 

TGGGGGTGGG TTGGATGTTT TTGTTTGGGG ACTTATTTAG TAGTATTGAG TCTCTTATAG 3480 
40 CCCTACTCTT AAGCCTTCAA TACTGTOCAC TCTTTATATT CCTTTACTTG CAGAATTTAT 3540 

AAAAGCCCCC AAACTGCATA TAATATGAGC CTTTAAAACA TGGGTAAAAC TAATCCCATT 3600 

GATOGGTTTG GATGGTATOT TAAGAAATOG AGATGCTGCA GAGCCCAACG TAATTTTTTA 3660 

AACAGCAAGT TTTCCATCTC CCTACGAATC CTCTGAAGCT TTTACCCAAG CCCTTTCTTG 3720 

CCTCTCCAGT GCTATTTTCC TTCAGATGGA CCTTAAACAT AATTTCTTGG ACACTACTAG 3780 
45 AGAGACTTCG AGGCAATAAT AAAAGATCAG TATTAACCAG CTATAACAGA GGTTTG ATC A 3840 

TGCTTACTTG TACAGTTTTT CCCCCGTTTT AAAAAGOAAT G TAATA AAAT TTGTTTTTTC 3900 

CATAGAATTA AATAATATTA AAATTGAGTG AAAGGTTGAT TGTTGATGAA TAGAATAGTA 3960 

CCTCTCATCT GTGCAGTGTC TCATTTCACC TCAGAGAAAA GOATACATAA GAGGAGTTTG 4020 

TAATTTATCT TAGGATATTC TAATTGCATT TAAAAGAACT TATCTTGCGC AGGGTAAATG 4080 
50 GGGGACTCAC ATACATATAT TAATACCTCT GACTCATTAA CAGAAAGAAA TACTTGGTAC 4140 

TTCTTTOGCT GAATGACCAT ACTGTGG AGG ATGCATACTA TTTGGTATAG AGAAATAAAT 4200 

GAGO AAOAAA GAACTGCTTA ATTAAATTAT CATTCATATG TTCATATAO A OACCATCTOO 4260 

TTGCCATGTG TATTATGACA CATACACTTT GAATAGTTAC ATATCACAAG TATGTAGTTC 4320 

ATGTTTGTGT TGGTGGGGTA AGGCATCAGG AAAAATGTAG TTAGTCTTTT CTTAACTTAT 4380 
55 ACCAAATTAA CCAACTATAT TATAGG AAAT ATGTGAAATT AGTTCATTAG CTTTATTCAC 4440 

TATTATCCAT TCACATGATA TTAAAACGTA CACTCACATG TTAGAATGAA AAGAGCAGTA 4500 

GTTATCTTAG ATTTTAAAAA CATGGATATC TTCTTGAATT CCTTCAAGAT TGAGGTAG AG 4560 

AATAAGAGCA AATCATTCTG GAAGTACCTT AAGOAAACAA ACAGCAGCAG ATATTTAGGT 4620 

TAAACTTATT TTCATAATTG TTTAATAACT TTTGTATAAT CTTCATTQCT A TTATGA OAO 4680 
60 AGAATGTATA TATCAAATAT GTGTAATGAT AAAATCTGAA TTGTAAAATT TTTGTA TATT 4740 

GTTAAAATTG TAATTCTAAA TTGTATTTCA AAAATGATTA TTTCTGATAT TGTTTTTATO 4800 

TCAOCCATG A TGAAAACTGG ACTTTATATA TCTAAACATA CAAGTATOAA CTATTC TATT 4860 

TAAAATTTTT AATAU Hill 1 111 1 1 1 1 10 GTGOCTATAATTGATTGGTC Aa a iCiuCTG 4920 

GCTTTTCTOC AATGAACATT GAAATCTTCC TGTATATGTT ACCAATAAGA AAACTA CCCT 4 980 
65 GGAACAGTAO AAAAACCCAA CAAGAGACTT OGCATTCAT C AAGC ACATTA TCAGACTTTG 5040 

AGAACATATT GAAGOCATTO ACTTTGAAAA TCA1U1UH 1 1 1C1CAAGAA GAAAGCAATO 5100 

GAGAAGCAAA TTTGTTTCAT TCAGTGAATC CCCAGTTTGG GGCTTGTGGG GCTTAGAGAC 5160 

ATTGTGAAAT CAAATCTTGT GTTATACTTT TCTCCTGGCT CACTTTTTTT G AGAAGGTTT 5220 

ATGGOCTATT TGGCTGGTGA GACACGATCC CCTCCTAAOA AAATGTAQOT OCTCAGACAO 5280 
70 GTAACCACTG CTGCTACTOT TTTTATTTGT TTGTTTGTTC AATTTTATTT AAGATTTGTT 5340 

TTTGTTGTAC TAGGATTTTA AAAAATGTAA TATATTGCAG GATTTATAAC CAG ^ 

Keg ID NO: 72 Protein sequence: 
Protein Accession BAA86552 

75 

1 11 21 31 41 51 
| I I I 1 I 
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MCSSVAAKLW FLTDRRKED YPQKEILRAL KAKCCEEELD FRA WMDEW LTIEQGNLGL 60 
RINGEUTA Y PQWWRVPT PWVQSDSDIT VLRHLEKMCC RLMNRPQAIL NCVNKFWTFQ 120 
ELAOHGVPLP DTFSYGGHEN FAKM1DEAEV LEFPMWKNT RGHRGKAVFL ARDKHHLADL 180 
SHLIRHBAPY LFQKYVKESH GRDVRVIWG GRWGTMLRC STDGRMQSNC SLGGVGMMCS 240 
LSEQGKOLAI OVSNILGMDV CGIDLLMKPD GSFCVCEANA NVGFIAFDKA CNLDVAGJA 300 
DYAASLLPSG RLTRRMSLLS WSTASETSE PELGPPASTA VDNMSASSSS VDSDPESTER 360 
ELLTKLPGGL FNMNQLLANE IK1XVD 

Seq ID NO: 73 UNA sequence 

Nucleic Acid Accession XM_040080.2 

Coding sequence: 159-1 1M (underlined sequeiKes contspo^ to start and stop cwkms) 
1 11 21 31 41 51 

CTGAGTGGGG GCGGGGACTG CrGGAGTTGC GGGGCCTGCC TGGGGTAGGG CGGGGCAGGA 60 
CAGCTTGGAG ATAGGGCOOG G AATTGCGOQ CGTCACTCTC CTCCTGCGAC CTAGOCAGGC 120 
GTOAGGGAGT GACAGCAGCG CATTCGCGGG ACGAGAGC QA TGA GTGAGAA CGCCGCACCA 180 
GGTCTGATCT CAGAGCTGAA GCTGGCTGTG CCCTGGGGCC ACATCGCAGC CAAAGCCTGG 240 
GGCICCCTGC AGGGCCCTCC AGTTCTCTGC CTGCACGGCT GGCTGG ACAA TGCCAGCTCC 300 
TTOGACAOAC TCATCOCTCT TCTCCCGCAA GACTTTTATT ACQTTOCCAT OG ATTTCG G A 360 
GGTCATGGGC TCTCGTCCCA TTACAGCCCA GGTGTCCCAT ATTACCTCCA GACTTTTGTG 420 
AGTGAG ATCC GAAGAGTTGT GGCAGCCTTG AAATGGAATC GATTCTCCAT TCTGGGCCAC 480 
AGCTTCGGTG GCGTCGTGGG CGGAATGTTT TTCTGTACCT TCCCCGAGAT GGTGGATAAA 540 
CTTATCTTGC TGGACACGCC GCIX7TTTCTC CTGGAATCAG ATGAAATGG A GAACTTGCTG 600 
ACCTACAAGC GGAGAGCCAT AGAGCACGTG CTGCAGGTAG AGGCCTCCCA GGAGCCCTCG 660 
CACGTGTTCA GCCTGAAGCA GCTGCTGCAG AGGTTACTGA AGAGCAATAG CCACITGAGT 720 
GAGGAGTGCG GGGAGCTTCT CCTGCAAAGA GGAACCACGA AGGTGGCCAC AGGTCTGGTT 780 
CTGAACAGAG ACCAGAGGCT CGCCTGGGCA GAGAACAGCA TTGACTTCAT CAGCAGGGAG 840 
CTGTGTGCGC ATTCCATCAG GAAGCTGCAG GCCCATGTCC TGTTGATCAA AGCAGTCCAC 900 
GGATATTTTG ATTCAAGACA GAATTACTCT GAGAAGGAGT CCCltflCGIT CATGATAGAC 960 
AOGATGAAAT CCACCCTCAA AGAGCAGTTC CAGrTTTGTGG AAGTCCCAGG CAATCACTGT 1020 
GTCCACATGA GCGAACCCCA GCACGTGGCC AGTATCATCA GCTCCTTCTT ACAGTGCACA 1080 
CACATGCTCC CAGCCCAGCT GTAGCTCrGG GCCTGGAACT ATGAAGACCT AGTGCTCCCA 1140 
GACTCAACAC TGGGACTCTG AGTTCCTGAO CCCCACAACA AGGCCAGGGA TGGTGGGQ AC 1200 
AGGCCTCACT AOTCTTGAGO CCCAOCCTAO GATGGTAGTC AOOOOAAGGA GCGAGATTCC 1260 
AACTTCAACA TCTGTGACCT CAAGGGGGAG ACAGAGTCTG GGTTCCAGGG CTGCTTTCTC 1320 
CTGGCTAATA ATAAATATCC AGCCAGCTGG AGGAA GG AAO GGCAGGCTGG GCCCACCTAG 1380 
CCTTTCCCTG CTGCCCAACT GGATGGAAAA TAAAAGGTTC TTGTATTCTC A 

Seg ID NO: 74 Protein seqaence: 
Protein Accession XPjMOOSai 

1 11 21 31 41 51 

MSENAAPOLI SELKLAVPWO HTAAKA WGSL QGPPVLCLHO WLDNASSFDR LIPLLPQDFY 60 
YVAMDFGGHG LSSHYSPGVP YYLQTFVSEI RRWAALKWN RFSHjGHSFG GWGGMFFCT 120 
FPEMVDKLIL LDTPLFLLES DEMENLLTYK RRAIEHVLQV EASQBPSHVF SLKQLLQRLL 180 
RSNSHLSEEC GELLLQRGTT KVATGLVLNR DQRLAWAENS IDF1SRELCA HSIRKLQAHV 240 
LUKAVHGYF DSRQNYSEKE SLSFMIDTMK STUCEQFQFV BVPGNHCVHM S3BPQHVASH 300 
SSPUQCIHMLPAQL 

Seq ID NO: 75 DNA sequence 

NvcJefc Acid Accession & NM_005794 

Coding sequence: 434-1276 (underlined sequences correspond to start tad stop codons) 
1 11 21 31 41 51 

G UT IUXI I C CAOOCngrGA AG ClTltil 1C TTTTGOTCTT CATGATAAAT CTTGuiuCIO 60 
CTCACTCGTT GGGTCCGTGC CACCTTTAAG AGCTGTAACA CTCACOGCGA AGGTCTGCAA 120 
CrrcACTCCT GGGGCCACiCA AGACCACG AA TGCACCGAGA GGAATGAACA ACTCTGGACA 180 
CACCATCTTT AAGAACCGTA ATACTCACCG CAAGGGTCTG CAACTTCATT CTTGAAGTCA 240 
GTGAGGCCAA GAACCCATCA ATTCCGTACA CATnTGGTG ACTTTGAAGA QACTGTCACC 300 
TATCACCAAG TGGTGAGACT ATTGCCAAGC AGTGAGACTA TTGCCAAGTG GTG AG ACCAT 360 
CACCAAGCGG TGAGACTATC ACCTATCGCC AAGTGGCCTG ATTCAGCAGG AAGCATCTCA 420 
GACACCAACC ACTATGCTGT CAGCAGTTGC CCGGGGCTAC CAGGGCTGGT TTCATCCCTG 480 
TGCTAGGCTT TCTGTGAGGA TG AGCAGCAC CGGGATAGAC AGGAAGGGCG TCCTGGCTAA 540 
CCGGGTAGCC GTGGTCACGG GGTCCACCAG TGGGATCGGC TTTGCCATCG CCCGACGTCT 600 
GGCCCGGGAC GGGGCCCACG TGGTCATCAG CAGCCGGAAG CAGCAGAACG TGG ACCGGGC 660 
CATGGCCAAG CTGCAGGGOG AGGGGCTOAG TGTGGCGGGC ATTGTGTGCC ACGTGGGGAA 720 
GGCTGAGGAC CGGGAGCAGC TGGTGGCCAA GGCCCTGGAG CACTGTGGGG GCX3TCGACTT 780 
CCTGGTGTGC AGCGCAGGGG TCAACCCTCT GGTAGGGAGC ACTCTGGGGA CCAGTGAGCA 840 
GATCTGGGAC AAGATOCTAA GTGTG AACGT GAAGTCCCCA GCOCTGCTGC TGAGCCAGTT 900 
GCTGCCCTAC ATGGAOAACA GGAGGGGTGC TGTCATCCTG GTCTCTTCCA TTGCAGCTTA 960 
TAATCCAGTA GTGGCGCtGG GTGTCTACAA TGTCAGCAAG ACAGCGCTGC TGGGTCTCAC 1020 
TAGAACACTG GCATTGGAGC TOGCCCCCAA GGACATCCGG GTAAACTGCG TGGTTCCAGG 1080 
AATTATAAAA ACTGACTTCA GCAAAGTGTT TCATGGGAAT GAGTCTCTCT GGAAGAACTT 1 140 
CAAGGAACAT CATCAGCTGC AGAGGATTGG GG AGTCAGAG GACTOTGCAG GAATCGTGTC 1200 
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CrrCCTGTGC TCTCCAGATG CCAGCTACGT CAACGGGQAO AACATTGCGG TOGCAGGCTA 1260 
CTCCACTCGG CTCTOAGAGG AGTGGGGGCG GCTGCOTAGC TGTGGTCCCA GCCCAGGAGC 1320 
CTGAGGGGOT GTCTAGGTGA TCATTTGQAT CTGGAGCAGA CTCTGCCATT CTGCCAGACT 13B0 
AGCAATTTGG GGGCTTACTC ATGCTAGGCT TGAGGAAGAA GAAAAACGCT TCGGCATTCT 1440 
5 CC 

ScqlDNO:76 Protein sequence; 
Protein Accession ft NPJD05785 

10 1 II 21 31 41 51 

MLSAVARGYQ GWFHPCARLS VRMSSTGIDR KGVLANRVAV VTGSTSGIGF A1ARRLARDG 60 
AHWKSRKQ QHVDRAMAKL QGEGLSVAGI VCHVGKAEDR EQLVAKALEH CGGVDFLVCS 120 
AGVNPLVGST LGTSEQIWDK ILSVNVKSPA LLLSQLLPYM ENRRGAVILV SSIAAYNPW 180 
15 ALGVYNVSKT ALLGLTRTLA LELAPKDIRV NCWPGDKT DFSKVFHGNB SLWKNFKEHH 240 
QLQRIGESED CAGJVSFLCS PDASYVNGEN IAVAGYSTRL 



SeqIDNO:77 DNAs , 
20 Niidek Acid Accession/?: NM_002425 

Coding sequence: 26-1453 (imdertmcd sequences conespood to start and stop codons) 

1 It 21 31 41 51 

25 AAAGAAGGTA AGGGCAGTGA GAATGATGCA TCTTGCATTC CrTGTGCTGT TGTGTCTGCC 60 
AGTCTGCTCT GCCTATCCTC TG AGTGGGGC AGCAAAAGAG GAGGACTCCA ACAAGGATCT 120 
TOCCCAGCAA TACCTAG AAA AGTACTACAA CCTCGAAAAG OATQTGAAAC AGTTTAGAAG 180 
AAAGGACAGT AATCTCATTG TTAAAAAAAT CCAAGGAATG CAGAAGTTCC TTGGGTTGG A 240 
GGTGACAGGG AAGCTAGACA CTGACACTCT GGAGGTGATG CGCAAGCCCA GGTGTGGAGT 300 
30 TCCTGACGTT GGTCACTTCA GCTCCTTTCC TGQCATOCCO AAOTQO AGGA AAACCCACCT 360 
TACATACAGG ATTGTGAATT ATACACCAGA TTTGCCAAGA GATGCTGTTG ATTCTCCCAT 420 
TG AGAAAGCT CTGAAAGTCT GGGAAGAGGT GACTCCACTC ACATTCTCCA GGCTGTATGA 480 
AGGAGAGGCT GATATAATG A TCTCTTTCGC AGTTAAAGAA CATGGAGACT TTTACTCTTT 540 
TGATGGCCCA GGACACAGTT TGGCTCATGC CTAOCCAOCT GGACCTGGGC TTTATGGAGA 600 
35 TATTCACTTT GATGATG ATG AAAAATGGAC AGAAGATGCA TCAGGCACCA ATTTATTCCT 660 
COTTGCTGCT CATGAACTTG GCCACTCCCT GGGGCTCTTT CACTCAGCCA ACACTGAAGC 720 
TTTGATGTAC OCACTCTACA ACTCATTCAC AG AGCTCGCC CAGTTCCGCC TTTOGCAAGA 780 
TG ATGTOAAT GGCATTCAGT CTCTCTACGG ACCTCCCCCT GOCTCTACTG AGGAAC CCCT 840 
GGTGCCCACA AAATCTGTTC CTTCGGGATC TG AGATGCCA GCCAAGTGTG ATCCTGCTTT 900 
40 GTCCTTCGAT GCCATCAGCA CTCTGAGGGG AGAATATCTG TTCTTTAAAG A CAGATA TTT 960 
TTGGCOAAGA TCCCACTGGA ACCCTGAACC TGAATTTCAT TTOATTTCTG CA TTTTGGC C 1020 
CTCTCTTCCA TCATATTTGG ATGCTGCATA TGAAGTTAAC AGCAGGGACA CCGTTTTTAT 1080 
TTTTAAAGGA AATGAGTTCT GGGOCATCAO AGGAAATGAG GTACAAGCA O GTTATCCAAG 1140 
AGGCATCCAT ACXXTTGGGTT TTCCTCCAAC CATAAGGAAA ATTGATGCAG CTGTTTCTGA 1200 
45 CAAGGAAAAG AAG AAAACAT ACTICTTTGC AGCGGACAAA TACT GGAGA T TTGATGAAAA 1260 
TAGCCAGTCC ATCGAGCAAG GCTTCCCTAG ACTAATAGCT GATGACTTTC CAGGAGTTGA 1320 
GCCTAAGGTT GATGCTGTAT TACAGGCATT TGGATTTTTC TACTTCTTCA GTGGATCATC 1380 
ACAGTTTGAG TTTGACCCCA ATGCCAGGAT GGTGACACAC AT ATTAAA GA GTAACAGCTG 1440 
^ OTTACATTGC TAGGCGAQAT AGGOQGAAGA CAGATATGGG TGTTTTTAAT AAATCTAATA 1500 
50 ATTATTCATC TAATGTATTA TGAGCCAAAA TGGTTAATTT TTCCTGCATG TTCTGTGACT 1560 
GAAGAAGATG AGCCTTGCAG ATATCTGCAT GTGTCATGAA GAATGTTTCT GG AATTCTTC 1620 
A CTlUCni 1 GAATTGCACT G AACAGAATT AAOAAATACT CATGTOCAAT AGGTGAGAG A 1680 
ATGTATTTTC ATAGATGTGT TATTACTTCC TCAATAAAAA GTTTTATTTT GGGCCTGTTC 1740 
CTT 
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SeqIDNO:78 Protein seqnenee: 
Protein Accession*: NPJD02416 



1 11 21 31 41 51 

60 i i i i I i 

MHLAFLVLIjC LPVCSAYPLS GAAKEEDSNK DLAQQYLBKY YNUEKDVKQF RRKDSNLIVK 60 
KKJGMQKFUG LBVTGKLDTD TLEVMRKPRCGYPDVGHFSS FPGMPKWRKT HLTYRIVNYT 120 
PDLPRDAVDS AIEKALKVWE BVTPLTFSRL YEGEADIMIS FAVKEHGDFY SFDGPGHSLA 180 
HAYPPGPGLY GDIHFDDDEK WTEDASGTNL FLVAAHELOH SLGLFHSANT EALMYPLYNS 240 
65 FTELAQFRLS QDDVNGIQSL YGPPPASTHB PLVPTKSVPS GSEMPAKCDP ALSFDAKTL 300 

RGEYLFFKDR YFWRRSHWNP EPEFHUSAF WPSLPSYLDA AYHVNSRDTV FDFKGNEFWA 360 
IRGNEVQAGY PRGIHTLGFP PTIRKIDAAV SDKEKKKTYF FAADKYWRFD ENSQSMEQGF 420 
PRUADDFPO VEPKVDAVLQ AFGFFYFFSG SSQFEFDPNA RMYTHUJCSN SWLHC 

70 SeqmNa 79 DNA sequence 

Nocktt Arid Accession*: NMJD00493.1 

Cooing sequence: 1-2043 (underlined sequences correspond to start end stop codons) 

75 1 11 21 l 31 1 41 | 51 

ATOCTGCCAC AAATACCCTT TTTGCTGCTA GTATCCTTGA ACTTGGTTCA TGGAGTGTTT 60 
TACGCTGAAC GATACCAAAC GCCCACAGGC ATAAAAGGCC CACTACCCAA CACCAAGACA 120 
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CAGTTCTTCA TTCCCTACAC CATAAAGAGT AAAGGTATAG CAGTAAGAGG AGAGCAAGGT 180 
ACTCCTGGTC CACCAGGCCC TGCTGGACCT CGAGGGCAXX CAGGTCCTTC TGGACCACCA 240 
GG AAAACCAG GCTAOGG AAO TCCTGGACTC CAAGGAQAGC CAGGGTTGCC AGGACCACOG 300 
GGAOCATCAO CTGTAGGGAA ACCAGOTGTG CCAGGACTCC CAGGAAAACC AGGAGAGAGA 360 
GGACCATATG GACCAAAAGG AGATGTTGGA CCAGCTGGCC TACCAGGACC CCGGGGCCCA 420 
CCAGGACCAC CTGGAATCCC TGG ACCGGCT GGAATTTCTG TGCCAGGAAA ACCTGGACAA 480 
CAGGGACCCA CAGGAGCCCC AGGACCCAGG GGCTTTCCTG GAGAAAAGGG TGCACCAGGA 540 
GTCCCTOGTA TGAATGGACA GAAAGGGGAA ATGGGATATG GTGCTCCTOG TCGTOCAGGT 600 
GAGAGGGOTC TTCCAGGOCC TCAGGGTCCC ACAGGACCAT CTGGCCCTCC TGG AGTGG GA 660 
AAAAGAGGTG AAAATGGGGTTOCAGGACAG CCAGGCATCA AAGGTGATAG AGGTTTTCCG 720 
GOAGAAATGG GACCAATTGO OX^CCAGGTCCCCAAGGCC CTCCTGGGGA ACGAGGGCCA 780 
GAAGGCATTG GAAAGCCAOG AGCTGCTGGA CCCCCAGGCC AGOCAGGOA T TCCAGGAACA 840 
AAAGGTCTCC CTGGGGCTCC AGGAATAGCT GGGCCCCCAG GGCCTOCTGG CTTTGGGAAA 900 
CCAGGCTTGC CAGGCCTGAA GGGAGAAAGA GGACCTGCTG GCCTTCCTGO GGGTtXAGGT 960 
GCCAAAGGGG AACAAGGGOC AGCAGGTCTT CCTGGGAAGC CAGGTCTGAC TGGACCCOCT 1020 
GGGAATA7GG GACCCCAAGG ACCAAAAGGC ATCCCGGGTA GOCATGGTCT CCCAGGCCCT 1080 
AAAGGTGAGA CAGGGCCAGC TGGGCCTGCA GGATACCCTG GGGCTAAGGG TG AAAGGGGT 1140 
TCCCCTGGGT CAGATGGAAA ACCAGGGTAC CCAGGAAAAC CAGGTCTCGA TGGTCCTAAG 1200 
GGTAACCCAO GGTTAOCAGG TCCAAAAGGT GATOCTGGAG TTGGAGGACC TCCTGGTCTC 1260 
CCAGGCCCTG TGGGCCCAGC AGGAGCAAAG GGAATGCCCG GACACAATGG AGAGGCTGGC 1320 
CCAAGAGGTG CCCCTGGAAT ACCAGGTACT AGAGGOCCTA TTGGGCCACC AGGCATTCCA 1380 
GGATTCCCTO GGTCTAAAGG GGATCCAGGA AGTCOCGGTC CTCCTOGCCC AGCTGGCATA 1440 
GCAACTAAGG GCCTCAATGG ACOCACCGGG CCAOCAGGGC CTCCAGGTOC AAGAGGCCCC 1500 
TCTGGAGAGC CTGGTCTTCC AGGGCCCCCT GGGCCTCCAG GCCCACCAGG TCAAGCAGTC 1560 
ATGCCTGAGO GTTTTATAAA GGCAGGCCAA AGCCCCAGTC TTTCTGGGAC CCCTL miTT 1620 
AOTGCCAAOC AGGGOGTAAC AGOAATGCCT GTGTCTOCTT TTACTGTTAT TCTCTCCAAA 1680 
GCTTACCCAC CAATAGGAAC TCCCATACCA TTTOATAAAA TTTTGTATAA CAGGCAACAG 1740 
CATTATGACC CAAGGACTGG AATCTTTACT TGTCAGATAC CAGGAATATA CTATTTTTCA 1800 
TACCACGTGC ATGTGAAAGG GACTCATGTT TGGGTAGGCC TGTATAAGAA TGGCACCCCT 1860 
GTAATGTACA (XTATGATQA ATACACCAAA GGCTACCTGG ATCAGGCTTC AGGGAGTGCC 1920 
ATCATCG ATC TCACAGAAAA TGACCAGGTG TGGCTCCAGC TTCCCAATGC CGAGTCAAAT 1980 
GGCCTATACT OCTCTGAGTA TGTCCACTOC TCTTTCTCAG GATTOCTAGT GGCTCCAATG 2040 
TGAGTACACC CCACAG AGCT AATCTAAATC TTGTGCTAGA AAAAGCA7TC TCTAACTCTA 2100 
CCCCACCCTA CAAAATGCAT ATGGAGGTAG GCTGAAAAGA ATGTAATTTr TATTTTCTGA 2160 
AATACAGATT TGAGCTATCA GACCAACAAA CCTTCCCCCT GAAAAGTGAO CAO CAACGTA 2220 
AAAACGTATG TGAAGCCTCT CTTGAATTTC TAGTTAGCAA TCTTAAGGCT CTTTAAGGTT 2280 
TTCTCCAATA TTAAAAAATA TCACCAAAGA AGTCCTGCTA TGTTAAAAAC AAACAACAAA 2340 
AAACAAAGCA ACAAAAAAAA AAATTAAAAA AAAAAACAGA AATAG AGCTC TAAGTTATQT 2400 
GAAATTTGAT TTGAGAAACT CGGCATTTCC TTTTTAAAAA AGCCTGTTTC TAACTATOAA 2460 
TATCAG AACT TCTAGGAAAC ATCCAGGAGG TATCATATAA CTTTGTAGAA CTTAAATACT 2520 
TGAATATTCA AATTTAAAAG ACACTGTATC CCCTAAAATA TTTCTOATGG TGCACTACTC 2580 
TGAGGCCTGT ATGGCCCCTT TCATCAATAT CTATTCAAAT ATACAGGTGC ATATATACTT 2640 
GTTAAAGCTC TTATATAAAA AAGOCCCAAA ATATTGAAGT TCATCTGAAA TGCAAGGTGG 2700 
TTTCATCAAT GAAC C1U1C AAAACTTTTC TATGATTGCA G AGAAGCTTT TTATATACOC 2760 
AGCATAACTT GGAAACAGGT ATCTG ACCTA TTCTTATTTA GTTAACACAA GTOTGATTAA 2820 
TTTGATTTCT TTAATTCCTT ATTGAATCTT ATGTG ATATG ATTTTCTGGA TTTACAGAAC 2880 
ATTAGCACAT GTACCTTGTG CCTCCCATTC AAGTGAAGTT ATAATTTACA CTGAGGGTTT 2940 
CAAAATTCGA CTAGAAGTGG AGATATATTA TTTATTTATG CACTGTACTO TATT TTTATA 3000 
TTGCTGTTTA AAAdTTTAA GCTOTGCCTC ACTTATTAAA GCACAAAATO TTT TACCT AC 3060 
TOCTTATTTA CGACACAATA AAATAACATC AATAGATTTT TAGGCTGAAT TAATTTGAAA 3120 
GCAGCAATTT GCTGTTCrCA ACCATTCTCT CAAGGCTTTT CATTOGACAC AATAAAATAA 3180 
CATCAATAGA TTTTTAGG 

^IDNGtSOPrnteinseoocDCc; 
Protein Accesaonft NP_000484.I 



I 11 21 31 41 51 

MUK}IPFIaL VSLNLVHGVF YAERYQTPTG DCGPLPNTKT QFFIPYTIKS KG1AVRGEQG 60 
TPOPPGPAGP RGHPGPSGPP GKPGYGSPGL QGEPGLPGPP GPSAVGKPGV PGLPGKPGER 120 
GFYGPKGDVO PAGLPGPRG? PGFPG1PGPA GISVPGXPGQ QGPTGAPGPR GFPGEKGAPG 180 
VPGMNGQKGB MGYGAPGRPG ERGLPGPQGP TGPSGPPGVG KRGENGVPGQ PGBCGDRGFP 240 
OEMGPIGPPG PQGPPGERGP EG1GKPGAAG APGQPGIPGT KGLPGAPGIA GPPGPPGFGK 300 
PGLPGLKGER GPAGLPGGPG AKGEQGPAGL PGKPGLTCPP GNMGPQGPKG IPGSHGLPGP 360 
KGETGPAGPA GYPGAKGERG SPOSDGKPOY PGKPGLDGPK GNPGLPGPKO DPGVGGPPGL 420 
PGPVGPAGAK GMPGHNGEAG PRGAPGIPGT RGPIGPPGIP GFPGSKGDPG SPGPPGPAG1 480 
ATKG1NGPTG PPGPPGPRGP SGHPGLPGPP GPPGPPGQAV MPEGFECAGQ RPSLSGTPLV 540 
SANQGVTGMP VSAFTVTLSK AYPAIGTPIP FDKILYNRQQ HYDPRTGIFT CQIPGIYYFS 600 
YHVHVKGTHV WVGLYKNGTP VMYTYDEYTK GYLDQASGSA HDLTENDQV WLQLPNAESN 660 
GLYSSEYVHS SFSGFLVAPM 

SeqIDNO:81 DN A sequence 

Nucleic Acid Accession Si NM_000786 

Coding sequence: 332-1861 (underfilled sequences cwrespond to start and stop codons) 



I U 21 31 41 51 
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CGCGATTCTC AGGOATTOAT CCOCCTCTTC AGGTAAGTTA TCTTCCGOCC CCGTAOCACT €0 
GTGCCACAGG CGCAGCCCGC TTCCTCAGGT GCCCTATCCC GCGCAGAAGA CCACGGCTTC 120 
ACAGAGTGTT ATTTAAGGGC GTGGOCAGCG GAACATCCCG CCCCATTCTG TGACGCACGG 180 
GGTGGCGCGC GTGGGACCCG AGGGGTGGGQ CTGGGTTTAG TAGGAGACCT GGGGCAAGGC 240 
CCCCTGTGGA CGACCATCTG OCAGCTTCTC TCGTTCCGTC G ATTGGGAGG AGOGGTGGOG 300 
ACCTCGGCCT TCAGTGTTTC CGACGG AGTG AATPGCGGCG GCGGCTGGG A TGCTGCTGCT 360 
GGGCTTGCTG CAGGCGGGTG GGTCGGTGCT GGGCCAGGCG ATGGAGAAGG TGACAGGCGG 420 
CAACCTCTTG TCCATGCTGC TGATCGCCTG CGCCTTCACC CTCAGCCTGG TCTACCTGAT 480 
(XGTCTGGCC GCCGGCCACC TGGTCCAGCT GCCCGCAGGG GTGAAAAGTC CTCCATACAT 540 
TTTCTOOCCA ATTCCATTCC TTGGGCATGC CATAGCATTT GGGAAAAGTC CAATTGAATT 600 
TCTAGAAAAT GCATATGAG A AGTATGGACC TGTATTTAGT TTTAGCATGG TAGGCAAGAC 660 
ATTTACTTAC CTTCTGGGGA GTGATGCTGC TOCACTGCTT TTTAATAGTA AAAATGAAGA 720 
CCTG AATGCA G AAGATGTCT ACAGTCGCCT GACAACACCT GTGTTTGGG A AGGGAGTTG C 780 
ATACGATGTG OCTAATCCAG TTTTCTTGGA GCAGAAGAAA ATCTTAAAAA GTGGCCTTAA 840 
CATAOCOCAC TTTAAACAGC ATGTTTCTAT AATTGAAAAA GAAACAAAGG AATACT TTOA 900 
GAGTTGGGGA GAAAGTGG AG AAAAAAATGT GTTTGAAGCT CTTTCTGAGC TCATAATTTT 960 
AACAGCTAGC CATTGTTTGC ATGGAAAGG A AATCAGAAGT CAACTCAATG AAAAGGTAGC 1020 
ACAGCTOTAT GCAGATTTGG ATGGAGOTTT CAGOCATGCA GCCTGGCTCT TACCAG GTTO 1080 
GCTCC C1 1 1U CCTAGTTTCA GACGCAGGGA CAGAGCTCAT OGGGAAATCA AGGATATTTT 1140 
CTATAAGGCA ATCCAGAAAC GCAGACAGTC TCAAGAAAAA ATTGATGACA TTCTCCAAAC 1200 
TTTACTAGAT GCTACATACA AGGATGGGOG TCCTTTGACT GATGATGAAG TAGCAGGGAT 1260 
GCTTATTGGA TTACTCTTGG CAGGGCAGCA TACATCCTCA ACTACTAGTG CTTGGATGGG 1320 
CTTCTTTtTG GCCAGAGACA AAACACTTCA AAAAAAATGT TATTTAGAAC AGAAAACAGT 1380 
CTGTGGAGAG AATCTGCCTC CTTTAACTTA TGACCAGCTC AAGGATCTAA ATTTACTTGA 1440 
TCGCTGTATA AAAGAAACAT TAAGACTTAG ACCTCCTATA ATGATCATGA TGAGAATGGC 1500 
CAGAACTCCT CAGACTGTGG CAGGGTATAC CATTOCTCCA GGACATCAGG TGTGTGTTTC 1560 
TCCCACTGTC AATCAAAGAC TTAAAGACTC ATGGGTAGAA OGCCTGGACT TTAATOCTGA 1620 
TCGCTACTTA CAGGATAACC CAGCATCAGG GGAAAAGTTT GCCTATGTGC CATTTGGAGC 1680 
TGGGCGTCAT CGTTGTATTG GGGAAAATTT TGOCTATGTT CAAATTAAG A CAATTTGGTC 1740 
CACTATGCTT CGTTTATATG AATTTGATCT CATTGATGGA TACTTTCCCA CTGTGAATTA 1800 
TACAACTATG ATTCACACCC CTGAGAACOC AGTTATCOGT TACAAACX3AA G ATCAAAAJS I860 
AAAAAGGTTG CAAGGAACGA ATATATGTGA TTATCACTGT AAGCCACAAA GGCATTCGAA 1920 
GAGAATG AAG TGTACAAAAC AACTCTTGTA GTTTACTGTT TTTTTAAGTG TGTAATTCTA 1980 
AAAGCCAGTT TATGATTTAG GATTTTGTTA ACTGAATGGT TCTATCAAAT ATAATAGCAT 2040 
TTGACACATT TTCTAATAGT TATGATACTT ATACATGTGC TTTCAGGAAG TTCCTTGGTG 2100 
AAACAATTGT TGAGGGGGGA TCTAGGTAAT TGGCAGATTC TAAATAATAT AATTTCCAGA 2160 
TAGTAATTTT AAGAGTACTC ATCGCTCTTG CCAAATAAOT TCAGGGTATT CAAATCTTGG 2220 
ACT AGTCCTG CAAGGTATAA AGAATAAAAA TOCCAGTG AG ATACTTGG AA ACCACAGTTT 2280 
ATTATTATTT ATCTGGGCAA TTATTGTGTG TGTGAGGATG GAAGGGTAGG GAATAATCGA 2340 
ACATCTAAAG OCTTGAATAA GAGAATACTA ATTGTTTTGG TATGATGATA CTCAGAAATG 2400 
GAGATATTAT AGGAAAAAGA AATCCTTTGG AATTTTAACT AAAATCACTG CATATGGGAA 2460 
ATTAAGAGAT CCAGGACCAT ATTTGATAAG AGTTCCTAAA AATAATGTAA TTATTAATGC 2520 
TAAAGACTGC TCATGTATCT TOATCTAATT ACTAAATAAA TTACATATTT ATTTACCTGA 2580 
TAAATATGTA TCTAGTTCTA CAAGGTCACA TTTATGTGGA AGTCCAAAGT CAAGTCCTTA 2640 
GGGGATAATT T1U1 1 1 1GGG CTCAG'l 101 1 CCCr G CnCC 11 ITlli 1 1 1 1 1 111 1 1 1 U 2700 
TTGAGATGG A GTCTCGCTCT GTTGOCCAGG CTGGAGTGCA GTGGTGCGAT CTCAGCTCAC 2760 
TGCATCCTCT GCCTCCCGGG TTCAAGCAAT TCTCTGCCTC AGOCTCCCAA GTAGTTGGGA 2820 
TTACAGGCAC CTGCCAOCAT GCCTGGCTAA 11111 lU'lAT TTTTAGTAGA GACGGGGGTT 2880 
TCACTATGTT GGCTAGGCTG GTCTTG AACT CCTGAGOCTC GTGAGTCCAC CCGCCTTOGC 2940 
CTCCCAAAGT GCTOGGATTA CAGGCATGAG CCACCGCACC TGGCCTTCCC TGCTTOCTCT 3000 
CTAGAATCCA ATTAGGGATG TTTGTTACTA CTCATATTGA TTAAAACAGT TAACAAACTT 3060 
U HU 11 1 1 AAAATGTGAG ATCAGTGAAC 1C1 UbTl 1 1 A AGATAATCTG AAACAAGGTC 3120 
CTTGGGAGTA ATAAAATTGG TCACATTCTG TAAAGCACAT TCTGTTTAGG AATCAACTTA 3180 
TCTCAAATTG TAACTCGGGG CCTAACTATA TGAGATGGCT GAAAAAATA C CAC ATOGTCT 3240 
GTTTTCACTA GGTGATGOCA AAAT ATTTTG CTTTATGTAT ATTACAGTTC TTTTTAAAAC 3300 
ACTGGAAGAC TCATGTTAAA CTCTAATTGT GAAGGCAG AA TCTCTGCTAA TTTTTCAGAT 3360 
TAAAATTCTC TTTGAAAAAAT 

Scq ID Na 82 Protein sequence: 
Protein Acccsskm NP_000777 

1 U 21 3E 41 51 
I I I I I I 

MAAAAGMLLL GLLQAGGSVL GQAMEKVTGG NLLSMUIAC AFTLSLVYU RLAAGHLVQL 60 
PAGVKSPPYI FSPIPFLGHA IAFGKSPIHF LENA YEKYGP VFSFTMVGKT FTYLLGSD AA 120 
ALLFNSKNED LNABDVYSRL TTPVFGKGVA YDVPNPVPtE QKKMLKSGLN IAHFKQHVSI 180 
lEKHTKEYFE SWGESGEKNV FEALSELHL TASHCLHGKB IRSQLNEKVA QLYADLDGGF 240 
SHAAWLLPGW LPLPSFRRRD RAHREKDIF YKAIQKRRQS QHODDILQT LLDATYKDGR 300 
PLTDDHVAGM UGLLLAGQH TSSTTSAWMG FFLARDKTLQ KKCYLEQKTV CGENLPPLTY 360 
DQLKD11«XD RC1KETLRLR PPIMIMMRMA RTPQTVAGYT IPPGHQVCVS PTVNQRLKIK 420 
WVERLDFNPD RYLQDNPASG EKFAYVPFGA GRHRQGENF AYVQKTIWS TMLRLYEFDL 480 
IDGYFPTVNYTTMIHTPENP VIRYKRRSK 

Seq ID NOc 83 DNA sequence 

Nucleic Add Accession ff: NM_006551^ 

Coding sequence: 64-336 (underlined sequences conespoad to start aid stop codons) 
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PCT/US02/02242 



1 11 21 31 41 51 

AATTCTAOAA OTCCAAATCA CTCATTGTTT GTGAAAGCTQ AOCTCACAOC AAAACAAOCC 60 
5 ACCATO AAQC TQTCQGTQTQ TCT0CTOCTO OTCAOQCTGG COCTCTGCrO CTAOCAGOCC 120 
AATGCCG AGT TCTGCCCAGC TCTTC3TTTCT GAGCTGTTAG ACTTCTTCTT CATTAGTGAA 180 
CCTCTGTTCA AGTTAAGTCT TGCCAAATTT GATGCCCCTC CGGAAGCTGT TGCAGCCAAG 240 
TTAGGAGTGA AGAGATGCAC GGATCAGATG TCCCTTCAGA AACGAAGCCT CATTGCGGAA 300 
GTCCTGOTOA AAATATTGAA GAAATGTAGT GTGTGACATG TAAAAACTTT CATCCTGGTT 360 
10 TCCACIGTCI TTCAATGACA CCCTGATCIT CACTGCAGAA TGTAAAGGTT TCAACGTCTT 420 
GCTTTAATAA ATCACTTGCT CTAC 

Seg ID NO: 84 Protein sequence: 
Protein Accession tk HP_006542.I 

1 11 21 31 41 51 
I I 1 I I I 

MKLSVCLLLV TLALCCYQAN AEFCPALVSE LLDFFFISEP LFKLSLAKFD APPEAVAAKL 60 
GVKRCTDOMS LQKRSUAEV LVKILKKCSV 

ScqIDNO: 85 DNA sequence 
Nucleic AcM Accession ft NMJD02317.1 

Coding sequence: 231*1484 (rmderiinrri scqurnrrs correspond to start and stop codons) 
25 1 11 21 31 41 51 

GOGCCAGGAC TOAGAAAGGG GAAAGGGAAG GGTGCCACGT CCGAGCAGCC GCCTTGACTG 60 
GGGAAGGGTC TGAATCCCAC CCTTOGCATT GCTTGGTGGA G ACTOAGATA CCCGTGCTCC 120 
GCTCGCC1XJC TKJUi 1 0 AAG A l 1 ll ' lUCl T OCCTCAOGTG ATTTGAGCCC CQTTTTTATT 180 

30 TTCTGTGAGC CACGTCCTCC TCGAGCGGGG TCAATCTGGC AAAAGGAGTG ATGCGCTTCQ 240 
CCTGGACCGT GCTCCTGCTC GGGCCTTTGC AGCTCTGCGC GCTAGTGCAC TGCGCCCCTC 300 
CCGCCQCCGQ CCAACAGCAG CCCCCGCGCG AGCCGCCGGC GGCTCCGGGC GOCTGGCGCC 360 
AGCAGATCCA ATGGGAGAAC AACGGGCAGG TGTTCAGCTT GCTGAGCCTG GGCTCACAGT 420 
ACCAGCCrCA GCGCCGCCGG GACCCGGGCG CCGCCGTCCC TGGTGCAGCC AACGCCTCCG 480 

35 CCCAGCAGCC CXXXVlCTCCG ATCCTGCTGA TCCGCGACAA CCGCACCGCC GCGGGGCGAA 540 
CGCGGACGGC CGGCTCATCT GGAGTCA<XG CFGGCCGCCC CAGGCCCACC OCCCGTCACT 600 
GGTTCCAAGC TGGCTACTCG ACATCTAGAG CCCGCGAAGC TGGGCCCTCG CGCGCGG AGA 660 
ACCAGACAOC GCOGGGAGAA GI 1CC1GCTCTCAGTAAOCT GCGGCCGCCC AGOCGCGTGG 720 
ACGGCATGGT GGGCGACGAC OCTTACAACC CCTACAAGTA CTCTGACGAC AACCCTTATT 780 

40 ACAACTACTA CGATACTTAT GAAAGGCCCA GACCTGGGGG CAGGTACCGG CCOGGATACG 840 
GCACTGGCTA CTTOCAGTAC GGTCTOCCAG ACCTGGTGGC CGACCOCTAC TACATCCAGG 900 
CGTCCACGTA CGTGCAGAAG ATGTCCATGT ACAACCTGAG ATGCGCGGCG GAGGAAAACT 960 
GTCTGGCCAG TACAGCATAC AGGGCAGATG TCAGAG ATTA TGATCACAGG GTGCTGCTCA 1020 
GATTTCCCCA AAGAGTGAAA AACCAAGGGA CATCAGATTT CTTACCCAGC CGACCAAGAT 1080 

45 ATTCCTGGGA ATGGCACAGT TGTCATCAAC ATTACCACAG TATGGATG AG TTTAGCCACT 1140 
TGTACCTGCT TGATGCCAAC ACCCAGAGGA GATGGGCTGA AGGCCACAAA GCAAGTTTCT 1200 
GTCTTGAAGA CACATCCTGT GACTATGGCT ACCACAGGCG ATTTGCATGT ACTGCACACA 1260 
CACAGGGATT GAGTCCTGGC TGTTATGATA CCTATGGTGC AGACATAGAC TOCCAGTGGA 1320 
TTGATATTAC AOATGTAAAA CCTGGAAACT ATATCCTAAA GGTCA OTGTA AACOOCAGCr 1380 

50 ACCIGGTTCC TG AATCTOAC TATACCAACA ATGTTGTGCG CTGTGACATT CGCTACACAG 1440 
GACATCATOC GTATGCCTCA GGCTGCACAA TTTCAOCGTA TTAQAAGGCA AAGCAAAACT 1500 
COCAATGGAT AAATCAGTGC C1 GG IG1 1C I GAAGTGGGAA AAAATAGACT AACTTCAGTA 1560 
GGATTTATGT ATTTTGAAAA AGAGAACAGA AAACAACAAA AGAATTTTTO TTTGGACTGT 1620 
TTTCAATAAC AAAGCACATA ACTOOATTTT OAACOCTTAA OTCATCATTA CTTGOOAAAT 1680 

55 TTTTAATGTT TATTATITAC ATCACnTGT GAATTAACAC AGTGTTTCAA TTCTCTAATT 1740 
ACATATTTGA CTCTTTCAAA AAAAAAAAAA AAAAAAAAAA 



15 
20 



60 



SeqIDN0t86 Protem seqnence: 
Protein Accession ft NP 002308.1 



1 11 21 31 41 51 
1 1 I I I I 

MRFAWTVLLL GPLQLCALVH CAPPAAGQQQ PPREPPAAPG AWRQQIQWEN NGQVFSLLSL 60 
GSQYQPQRRR DPGAAVPGAA NASAQQPKTP ILURDKRTA AGRTRTAGSS GVTAGRPRPT 120 
65 ARHWFQAGYS TSRAREAGPS RAENQTAPGE VPALSNLRPP SRVDGMVGDD PYNPYKYSDD 180 
NPYYNYYDTY ERPRPGGRYR PGYGTGYFQY GLPDLVADPY YIQASTYVQK MSMYNLRCAA 240 
EHNCLASTAY RADVRDYDHR VLLRFPQRVK NQGTSDFLPS RPRYSWBWHS CHQHYHSMDE 300 
FSHLYLLDAN TQRRWAEGHK ASFCLEOTSC DYGYHRRFAC TAHTQGLSPG CYDTYGADID 360 
CQWIDITDVK PGNYILKVSV NPSYLVPESD YTNNWRCDI RYTGHHAYAS GCTBPY 



ScqtDNO-87 DNAs 
Nucleic Acid Accession ft NM 006419.1 

Coding sequence: 91-420 (underlined sequences correspond to start and stop codons) 

75 1 11 21 31 41 51 
| I I I I I 

TTCGGCACTT GOG AGAAGAT GTTTGAAAAA ACTGACTCTG CTAATGAGCC 7GG ACTCAGA 60 
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GCTCAAGTCT GAACTCTACC TCCAGACAGA ATQAAGTTCA TCTCGACATC TCTGCTTCTC 120 
ATGCTGCTGG TCAGCAGCCT CTCTCCAGTC CAAGGTGTTC TGGAGGTCTA TTACACAAGC 180 
TTGAGGTGTA OATGTGTCCA AGAGAGCTCA GTCTTTATCC CTAGACGCTT CATTGATCGA 240 
ATTCAAATCT TGCCCCGTGG GAATGGTTGT CCAAGAAAAG AAATCATAGT CTGGAAGAAG 300 
AACAAGTCAA TTGTGTGTGT GGACCCTCAA GCTGAATGGA TACAAAGAAT OATOQAAGTA 360 
TTGAGAAAAA GAAGTTCTTC AACTCTACCA GTTCCAGTGT TTAAGAGAAA G ATTCC CJJgA 420 
TGCTO ATATT TCCACTAAGA ACACCTGCAT TCTTCCCTTA TCCCTGCTCT GGATTTTAGT 480 
IIIUIUUIA OTTAAATCII 1 1CCAGGGAG AAAGAACTTC CCCATACAAA TAAGGCATOA 540 
GGACTATGTG AAAAATAACC TTGCAGOAQC TGATGOOGCA AACTCAAGCT TCTTCACTCA 600 
CAGCACCCTA TATACACTTG GAGTTTGCAT TCTTATTCAT CAGGGAGGAA AGT TTCTT TG 660 
AAAATAGTTA TTCAGTTATA AGTAATACAG GATTATTTTG ATTATATACT TGTTGTTTAA 720 
TGTTT AAAAT TTCTTAGAAA ACAATGGAAT GAGAATTTAA GCCTCAAATT TGAACATGTG 780 
GCTTGAATTA AGAAG AAAAT TATGGCATAT ATTAAAAGCA GGCTTCTATG AAAG ACTCAA 840 
AAAGCTOCCT GGGAGGCAGA TGGAACTTGA GCCTGTCAAG AGGCAAAGGA ATOCATGTAG 900 
TAGATATCCT CTGCTTAAAA ACTCACTAOG GAGGAGAATT AAGTCCTACT TTTAAAGAAT 960 
TTCTTTATAA AATTTACTGT CTAAGATTAA TAGCATTCGA AGATCCCCAG ACTTCATAGA 1020 
ATACTCAGGG AAAGCATTTA AAGGGTGATG TACACATGTA TCCTITCACA CATTTGCCTT 1080 
GACAAACTTCTnCACTCAC Alt;i llUl^ CroA C ' U Ul HOlO GGGGCGGGGCO G^GG 1140 
GGACTCTGGT ATCTAATTCT TTAATGATTC CTATAAATCT AATGACATTC AATAAAGTTG 1200 
AGCAAACATT TTACTT 

SeqIDNO: 88 Protcm sequence: 
Protein Accession ft NPJW6410.1 

1 It 21 31 41 51 

MKF1STSLLL MLLVSSLSPV QGVLEVYYTS LRCRCVQBSS VFIPRRFIDR IQILPRGNGC 60 
PRKEUVWKK NKSIVCVDPQ AEWIQRMMEV LRKRSSSTLP VPVFKRKIP 

Seq ID NO: 89 DNA sequence 

Nucleic Acid Accession ft NM_0O2652 

Coding sequence: 37-477 (tmderifoed sequences correspond to start and stop codons) 
1 11 21 31 41 51 

Li ii If trJiG ATArATTGOC T I CIUI 1 1 1C TCCAGCATGC GCTTGCTCCA GCTCCTOTTC 60 
AGGGCCAGCC CTGCCACCCT GCTCCTGGTT CTCTGCCTGC AGTTGGGGGC CAACAAAGCT 120 
CAGGACAACA CTCGGAAGAT CATAATAAAG AATTTTGACA TTCCCAAGTC AGTACGTCCA 180 
AATGACGAAG TCACTGCAGT GCTTGCAGTr CAAACAGAAT TGAAAGAATG CATGGTGGTT 240 
AAAACTTACC TCATTAGCAG CATCOCTCTA CAAGGTGCAT TTAACTATAA GTATACTGCC 300 
TGCCTATGTO ACGACAATCC AAAAACCTTC TACTGGGACT TTTACACCAA CAGAACTGTG 360 
CAAATTGCAG OOGTOGTTOA TGTTATTOGO GAATTAGGCA TCTGCCCTGA TGATGCTGCT 420 
GTAATCCCCA TCAAAAACAA CCGGTTTTAT ACTATTO AAA TCCTAAAGGT AQA ATAAJGG 480 
AAGCCCTGTC TGTTTGCCAC ACCCAGGTGA TTTCCTCTAA AGAAACTTGG CTGGAATTTC 540 
TGCTGTGGTC TATAAAATAA ACTICTTAAC ATGCTT 

Seqro>K> 90 Protein sequence: 
Protein Accession ft NPJD02643.1 

t 11 21 31 41 51 

MR1JUQLLFRA SPATLLLVLC LQLGANKAQD KTRKHIKNF MPKSVRPND EVTAVLAVQT 60 
ELKECMWKT YUSSIPLQG AFNYKYTACL CDDNPKTFYW DFYTNRTVOJ AAWDVIREL 120 
GICFDDAA VI FDCNNRFYTI HLKV 

ScqIDNO: 91 DNA sequence 
KucIettAddAcccssianft AK000341 

Codineseqocnce: 85-975 (uwVaiM seq^erjces conespond to start mrfs^ 
I 11 21 31 41 51 

GATAGCGCCG GGCAOAGGOA OCCOGCTACC CTGGACAGCG CATCGOCGCC CGCCCGGOTC 60 
GCCGCOCCAC AGCCOCTOCG GATCATGGAA CATCTAAAGG CCTTTGATGA TGAAATCAAT 120 
GCTTTTTTGG ACAATATGTT TGGACCGCGA GATTCTCGAG TCAGAGGGTG GTrCACGTTG 180 
GACTCTTACC TTCCTACCTT 1 1 1 It: 11 ACT QTCATGTATC TGCTCTCAAT A TGGCT GGGT 240 
AACAAGTATA TO AAGAACAG ACCTGCTCTT TCTCTCAGGG GTATCCTCAC CTTGTATAAT 300 
CTTOO AATCA C^CITCTCTC CGCGTACATO CTGGCAGAGC TCATTCTCTC CACTTGGGAA 360 
GGAGGCTACA ACTTACAGTO TCAAGATCTT ACCAOCOCAO GGOAAGCTGA CATC OQOOT A 420 
GCCAAGGTGC TTTGGTGOTA CT ATTTCTCC AAATCAGT AG AGTTCCTGGA CACAATTTTC 480 
TTCGTTTTGC QOAAAAAAAC GAGTCAGATT A C1 1 1 iVl ' AC ATOTATATCA TCATOCTTCT 540 
ATGTTTAACA TCTGGTGGTO TXTTCTTGAAC TGGATACCTT GTGGACAAAG TTTCITTGGA 600 
CCAACACTG A ACAGTTTTGT CCACATTCTT ATGTACTCCT ACTATGGACT TTCTGTGTTT 660 
CCATCTATGC ACAAGTATCT TTGGTGGAAG AAATATCTCA CACAGGCTCA GCTGGTGCAG 720 
TTCOTGCTCA CCATCACGCA CACCATGAGC OOCOTCQTGA AACCGTGTCG CTTC CCCTT C 780 
GGTTGTCTCA TCTTCCAGTC ATCTTATATG CTAACGTTAG TCATCCTCTT CTTAAATTTT 840 
TATGTTCAGA CATACCGAAA AAAGCCAATG AAGAAAGATA TGCAAGAGCC ACCTGCAGGG 900 
AAAGAAGTGA AGAATGGTTT TTCCAAAGCC TACTTCACTG CAGCAAATGG AGTG ATGAAC 960 

386 



WO 02/059377 



AAG AAAGCAC AATAAAAATG AGTAACAGAA AAAOCACATA TACTAGCCTA ACAOATTOOC 1020 
TTGTTTTAAA OCAAAOACTO AATTGAAGGT TACATQTTTT AOGATAAACT AATTTCTTTT 1080 
GAGTTCATAA ATCATTTGTA CCCAG AATGT ATTAATATAT TGCTATTAGG TTAATCTGTT 1140 
AACTGAATGC TTTG ATCAGC ATTGAGGTGA TGCTCACCTC CGAGGACCTC AGAACTGGTG 1200 
CAGdTCTCT CTCCCTCCCT CCCACAGACT OAACCnTCG CCAGAAGCTG TCCTTATAAC 1260 
GCCTTATACG CATACACAGC CAGGAAACGT GGAGCATTGT TTCTCACAGA GAGTCTCCAA 1320 
ATAAAAAGGG TTTTGTTCAG ATTAAAATGT TTACAACAAA ATGTTAATTA TATTCTAAAT 1380 
ACAGGGTATG TTCTAATCTA TATTAAGCAA TAATGCCAGT GCATAATCAT TCCATTTGTT 1440 
CCTTTAGCAA TCAACCCCAG AAAATATTAA AATGGGATCA TACACAGAAG ATAGAAAAAT 1500 
CTAGCAAAAC T tVlVl 1 IC1 OTAAGCCAGA O l Cl IXJlCIA TCAGATTCCC ACAACCACTC 1560 
CTGATTCTAA ATTTAGTGAT ATGGTAATGA AATTGGTATT TATTTTAAAT ATTAGTTATT 1620 
CTAAGGAGAA AAAAATGCTT CTOCAAG ATT TTCATAATTC AGGGGCTGTG GATAGGATTG 1680 
TTCCTCTGTT TCOCTAATCA TTCATCTGTT CATGTCTCCC TCTTGTGOCA GTCAGCCTAG 1740 
GTT ATACAG A TGCCATGCTC CACACCACGA GCAGTGTACA AATCTGGCTG CCCGTTTACT 1800 
TTCTG AGCAA GCACTGGAGT CCACTCCGAC CTTTTTCTTT GAACATGCAT GCTGCTGGAA 1860 
TATGTATAAA TCAGAACTAG CAGAAGTAGC AGAGTGATGG GAGCAAAATA GGCACTGAAT 1920 
TCGTCAACTC TTTTTTGTGA GCCTACTTGT GAATATTACC TCAG ATACCT GTTGTCACTC 1980 
TTCACAGGTT ATTTAAGTTC TTGAAGCTGG GAGGAAAAAG ATGGAGTAGC TTGGAAAGAT 2040 
TCCAGCACTG AGCCGTGAGC CGGTCATGAG CCACGATAAA AAATGCCAGT TTGXjCAAACT 2100 
CAGCACTCCT GTTCCCTGCT CAGGTATATG CGATCTCTAC TGAGAAGCAA GCACAAAAGT 2160 
AGACCAAAGT ATTAATGAGT ATTTOCTTTC TCCATAAGTG CAGGACTGTT ACTCACTACT 2220 
AAACTCTACC AAGAATGGAA AOCAAOAATA TTTTCTGAAG ATTTTTTTGA AGATTAATTT 2280 
ATACCCTATA AAATAAAACT TGTTAGCTTC GATGAAGTCA AAAAAAAAAA AAAAAAAAAA 

Seo ID NO. 92 Protein sequence: 
Protein Accession ft BAA91096.1 

1 11 21 31 41 51 
I 1 1 1 I 1 

MEHLKAFDDB INAFLDNMFG PRDSR VRGWF TLDSYLPTFF LTVMYLLSIW LGNKYMKNRP 60 
ALSLRGILTL YNLCTTLLSA YMLAEULST WEGGYNLQCQ DLTSAGEAPI RVAKY LWWYY 120 
FSKSVEFLDT IFFVLRKKTS QTTFLHVYHH ASMFNIWWCV LNWIPCGQSF FGPTLNSFVH 180 
ILMYSYYGLS VFPSMHKYLW WKKYLTQAQL VQFVLTITHT MSAWKFCGF PFGCUFQSS 240 
YMLTLVHJ1.WYVQTYRKK PMKKDMQEPP AGKBVKNGFS KAYFTAANGV MKKKAQ 

SeqIDNO:93 DN A sequence 

Nucleic Acid Accession #: NM 000044 

Coding sequence; 1115-3874 (underlined sequences correspond to start and stop codons) 

1 II 21 31 41 51 
I I 1 I 1 I 

CGAGATCCCG GGGAGCCAGC TTGCTGGGAG AGCGGGACGG TCCGGAGCAA GCCCACAGGC 60 
AGAGGAGGCG ACAGAGGGAA AAAGGGCCGA GCTAGCCGCT CCAGTGCTGT ACAGGAGCCG 120 
AAGGGACGCA CCAOGCCAGC CCCAGCCCGG CTCCAGCGAC AGCCAACGCC TCTTGCAGCG 180 
CGGCGGCTTC GAAGCCGCCG CCCGGAGCTG CCCTTTCCTC TTCGGTGAAG TTTTTAAAAG 240 
CTGCTAAAGA CTCGGAGGAA GCAAGGAAAG TGCCTGGTAG GACTGACGGC TGCCTTTGTC 300 
CTCCTCCTCT CCACCCCGCC TCCCCCCACC CTOCCTTCCC CCCCICCOCC G ICl'lUiUIC 360 
CCGCAGCTGC CTCAGTCGGC TACTCTCAGC CAACCCCCCT CACCACCCTT CTCCCCACCC 420 
GCCCCOCCGC CCCCGTCGGC CCAGCOCTGC CAGCCCGAGT TTGCAGAGAG GTAACTCCCT 480 
TTGGCTGCGA GCGGGCO AGC TAGCTGCACA TTGCAAAGAA GGCTCTTAGG AGCCAGGOGA 540 
CTGGGGAGCG GCTTCAGCAC TGCAGCCACG ACCCGCCTGG TTAGAATTCC GGCGGAGAGA 600 
ACCCTCTCIT TTOCCCCACT CTCTCTCCAC CTCCTCCTGC CrrCCCCACC CCOAOTOCOO 660 
AGCAGAGATC AAAAOATG AA AAOGCAOTCA GGTCTTCAGT AGCCAAAAAA CAAAACAAAC 720 
AAAAACAAAA AAGCCOAAAT AAAAGAAAAA GATAATAACT CAOTTCT TAT TTGCACCTAC 780 
TTCAGTGGAC ACTGAATTTG GAAGGTGGAG GATTTTGTTT TTTTCTTTTA AGATCTGGGC 840 
ATCrTTTGAA TCTACCCTTC AAGTATTAAG AGACAGACTG TGAGCCTAGC AGGGCA GATC 900 
TTOTCCACCG 'IGTG'IOTUI' 1U1GCACGAG ACTTTGAGGC TGTCAGAGCG CTTTTTGCGT 960 
GGTTGCTCCC GCAAGTTTCC TTCTCTGGAG CTTCCCGCAG GTGGGCAGCT AGCTGCAGCG 1020 
ACTACCGCAT CATCACAGCC TGTTGAACTC TTCTGAGCAA GAGAAGGGGA GGCGGGGTAA 1080 
GGGAAGTAGO TGGAAGATTC AGCCAAGCTC AAGGATGG AA GTGCAGTTAG GGCTGGGAAG 1140 
GGTCTACCCT CGGCCGCCGT CCAAGACCTA CCGAGGAGCT TTCCAGAATC TGTTCCAGAG 1200 
CGTGCGCGAA GTGATOCAG A ACCOGGGCCC CAGGCACCCA GAGGCCGCGA GCGCAGCACC 1260 
TCCCGGCGCC AGTTTGCTGC TGCTGCAGCA GCAGCAGCAG CAGCAGCAGC AGCAGCAGCA 1320 
GCAGCAGCAG CAGCAGCAGC AGCAGCAAGA GACTAGCCOC AGGCAGCAGC AGCAGCAGCA 1380 
GGGTGAGGAT GGTTCTCCCC AAGCCCATCG TAGAGGCCCC ACAGGCTACC TGGTCCTGGA 1440 
TGAGGAACAG CAACCTTCAC AGOCGCAGTC GGCCCTGGAG TOCCACCCCG AGAGAGGTTG 1500 
CGTCCCAGAG CCTGGAGCCG CCGTGGCCGC CAGCAAGGGG CTGCCGCAGC AGCTGCCAGC 1560 
AOCTCCGGAC GAGG ATGACT CAGCTGCCCC ATCCACGTTG TCCCTGCTGG GCCCCACTTT 1620 
CCCCGGCTTA AGCAGCTGCT CCGCTGACCT TAAAG ACATC CTGAGCGAGG CCAGCACCAT 1680 
GCAACTCCTT CAOCAACAGC AGCAGGAAGC AGTATCCGAA GGCAGCAGCA GCGGGAGAGC 1740 
G AGGGAGGCC TCGGGGGCTC CCACTTCCTC CAAGGACAAT TACTTAGGGG GCACTTCGAC 1800 
CATTTCTGAC AACGCCAAGG AGTTGTGTAA GGCAGTGTCG OTGTCCATGG GCCTGGGTGT I860 
GGAGGCGTTG GAGCATCTGA GTCCAGOGGA ACAGCTTCGG GGGGATTGCA TGTACGCCCC 1920 
ACTTTTGGGA GTTCCACCCG CTGTGCGTCC CACTCCTTGT GCCCCATTGG CCGAATGCAA 1980 
AGGTTCTCTG CTAGACGACA GCGCAGGCAA GAGCACTOAA GATACTGCTG AGTATTCCCC 2040 
TTTCAAGGG A GGTTACAOCA AAGGGCTAG A AGGCGAGAGC CTAGGCTGCT CTGGCAG0GC 2100 
TGCAGCAGGG AGCTCCGGGA CACTTGAACT GCCGTCTACC CTGTCTCTCT ACAAGTCCGG 2160 
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AGCACTGOAC GAGGCAGCTG CGTACCAGAG TCGCGACTAC TACAACTTTC CACTGGCTCT 2220 
GGCCGOACXX) CCGCCCCCTC CGCCGCCTCC CCATCCCCAC GCTCGCATCA AGCTGGAGAA 2280 
CCOGCTGGAC TACGGCAGCG OCTGGGCGGC TGCGGCGGCG CAOTGCCGCT ATGGGG ACCT 2340 
GGCGAGCCTG CATGGCGCGG GTGCAGCGGG ACCCGGTTCT GGGTCACCCT CAGCCGCCGC 2400 
rrCCTCATCC TGGCACACTC TCTTCACAGC CGAAGAAGGC CA G 1 1 0 1 ATO GACCGTGTGG 2460 
TGGTGGTGGG GGTGGTGGCG GCGGCGGCGG CGGCGGCGGC GGCGGCGGCG GCGGCGGCGG 2520 
CGGCGGCGGC GAGGCGGGAG CTGTAGCCCC CTAOGOCTAC ACTCGGCCOC CTCAGGGGCT 2580 
GGCGGGCCAG GAAAGCGACT TCACCGCACC TGATGTGTGG TACOCTGGCG GCATGGTGAG 2640 
CAG AGTGCCC TATCCCAGTC CCACITGTGT CAAAAGCGAA ATGGGCCCCT GGATGGATAG 2700 
CTACTCCGGA CCTTACGGGG ACATBCGTTT GGAGACTGCC AGGGACCATG TTTTGOCCAT 2760 
TG ACTATTAC TTTCCACCCC AGAAGACCTG OCTG ATCTGT GGAGATGAAG CTTCTGGGTG 2820 
TCACTATGGA GCTCTCACAT GTGGAAGCTG CAAGGTCTTC TTCAAAAGAG OCGCTGAAGG 2880 
GAAACAGAAG TACCTGTGCG CCAGCAGAAA TG ATTGCACT ATTGATAAAT TCCGAAGGAA 2940 
AAATTGTCCA TCTTGTCGTC TTCGGAAATG TTATG AAOCA GGG ATGACTC TGGGAGCCCG 3000 
GAAGCTGAAG AAACTTGGTA ATCTGAAACT ACAGGAGGAA GGAG AGGCTT CCAGCACCAC 3060 
CAGCCCCACT GAGGAGACAA CCCAGAAGCT GACAGTGTCA CACATTGAAG GCTATGAATG 3120 
TCAGCCCATC TTTCTGAATG TCCTGGAAGC CATTGAGCCA GGTGTAGTGT GTGCTGGACA 3180 
CGACAACAAC CAGCCCGACT CCTTTGCAGC CTTGCTCTCT AGCCTCAATG AACTGGGAGA 3240 
G AG ACAGCTT GTACACGTGG TCAAGTGGGC CAAGGCCTTG CCTGGCTTCC GCAACTTACA 3300 
CGTGGACGAC CAGATGGCTG TCATTCAQTA CTCCTGGATG GGGCTCATGG TGTTTGCCAT 3360 
GGGCTGGCGA TCCTTCACCA ATGTCAACTC CAGGATGCTC TACTTCGCCC CTGATCTGGT 3420 
TTTCAATGAG TACCGCATGC ACAAGTCCCG GATGTACAGC CAGTGTGTCC GAATGAGGCA 3480 
CCTCTCTCAA GAGTTTGGAT GGCTCCAAAT CACCCCCCAG GAATTCCTGT GCATGAAAGC 3540 
ACTGCTACTC TTCAGCATTA TTCCAGTGGA TGGGCTGAAA AATCAAAAAT TCTTTGATGA 3600 
ACTTCGAATG AACTACATCA AGGAACTCGA TCGTATCATT GCATGCAAAA GAAAAAATCC 3660 
CACATOCTGC TCAAGAOGCT TCTACCAGCT CACCAAGCTC CTGGACTCCG TGCAGCCTAT 3720 
TGCGAG AG AG CTGCATCAGT TCACTTTTGA CCTGCTAATC AAGTCACACA TGGTGAGCGT 3780 
GGACTTTCCG GAAATGATGG CAGAGATCAT CTCTGTGCAA GTGCCCAAGA TCCTTTCTGO 3840 
GAAAGTCAAG CCCATCTATT TCCACACCCA GTGAAGCATT GGAAACCCTA TTTOCCCACC 3900 
CCAGCTCATC CCC CC1 1 iVA GATGTCTTCT GCCTGTTATA ACTCTGCACT ACTOCTCTGC 3960 
AGTGCCTTGG GGAATTTCCT CTATTGATGT ACAGTCTGTC ATGAACATGT TCCTGAATTC 4020 
TA1 1JGCIU G G L ' U J JJ ' lll T C 1C1 J JCJC TO L ' l ' l iCJJJ 1 IVl 1VI 1UC CTCCCTATCT 4080 
AACCCTCCCA TGCCACCTTC AGACTTTGCT TCCCATTGTG GCTCCTATCT GTGTTTTGAA 4140 
TGGTGTTGTA TGCCTTTAAA TCTGTGATGA TCCTCATATG GOOCAGTGTC AAGTTGTGCT 4200 
TGTTTACAGC ACTACTCTGT GCCAGCCACA CAAACGTTTA CTTATCTTAT GCCAOGGG AA 4260 
GTTTAGAGAG CTAAGATTAT CTGGGGAAAT CAAAACAAAA AACAAGCAAA CAAAAAAAAA 4320 
A 

Sea ID NO: 94 Protein s eq u ence : 
Froten Accession ft NP000035.I 

1 11 21 31 41 51 
I I I I I I 

MEVQLGLGRV YPRPPSKTYR GAFQNLFQSV RBVIQNPGPR HPEAASAAPP GASLLLLQQQ 60 
QQQQQQQQQQ QQQQQQQQET SPRQQQQQQG EDGSPQAHRR GPTGYLVLDE EQOPSQPQSA 120 
LECHPERGCV PEPGAA V AAS KGLPQQLPAP PDEDDSAAPS TLSLLGPTFP GLSSCSADLK 180 
DILSBASTMQ LLQQQQQEAV SEGSSSGRAR EASGAPTSSK DNYLGGTSTI SDNAKELCKA 240 
VSVSMGLGVE ALEHLSPGEQ LRGDCMYAPL LGVPPAVRPT PCAPLAECKG SLLDDSAGKS 300 
TEDTAEYSPF KGGYTKGLBG ESLGCSGSAA AGSSGTLELP STLSLYKSOA LDEAAAYQSR 360 
DYYNFPLALA GPPPPPPPPH PHARDCLENP LDYGSAWAAA AAQCRYGDLA SLHQAGAAGP 420 
GSGSPSAAAS SSWHTLPTAE EGQLYGPCGG GGGGGGGGGG GGGGGGGGGG GGEAGAVAPY 480 
GYTRPPQGLA GQBSDFTAPD VWYPGGMVSR VPYPSPTCVK SEMGPWMDSY SGFYGDMRLE 540 
TARDHVLPH) YYFPPQKTCL ICGDEASGCH YGALTCGSCK VFFKRAAEGK QKYLCASRND 600 
CTEDKFRRKN CPSCRLRKCY EAGMTLGARK LKKIjGNLKLQ EEGBASSTTS PTEETTQKLT 660 
VSHffiGYECQ PIFLNVLEA1 EPGWCAGHD NNQPDSFAAL LSSLNELGER QLVHWKWAK 720 
ALPGFRNLHV DDQMA VKJYS WMGLMVF AMG WRSFTNVNSR MLYFAPDLVF NEYRMHKSRM 780 
YSQCVRMRHL SQEFGWLQIT PQEFLCMKAL LLFSHPVDG LKNQKFFDEL RMNYKELDR 840 
HACKRKNPT SCSRRFYQLT KLLDSVQP1A RELHQFTFDL UKSHMVSVD FPEMMAEHS 900 
VQVPKILSGK VKPIYFHTQ 

SeqIDNO:95 DN A sequence 

Nucleic Acid Accession^ NM002497 

Coding sequence: 135-1472 (underlined sequences «mrspcnd to start and stop ccdons) 

1 11 21 31 41 51 
I 1 I I t I 

GGCACGAGTA GGGGTGGCGG GTCAGTBCTG CTCGGGGGCT TCTCCATCCA GGTCCCTGGA 60 
GTrCCTGOTCCCTGGAGCTC 03CACTTGGC GCGCAACCTG CGTGAGGCAG (XTCGACTCTG 120 
GCGACTGGCC GGCCATgCCT TCCGGGGCTG AGGACTATGA AGTGTTGTAC ACCATTGGCA 180 
CAGGCTCCTA CGGCCGCTGC CAGAAGATCC GGAGGAAGAG TGATGGCAAG ATATTAGTTT 240 
GOAAAG AACT TGACTATGGC TCCATGACAG AAGCTG AGAA ACAGATGCTT GTTTCTGAAG 300 
TGAATTTGCT TOOTGAACTG AAACATCCAA ACATCGTTCO TTACTATOAT CGGATTATTO 360 
ACCGGACCAA TACAACACTG TACATTGTAA TGGAATATTG TGAAGGAGGG GATCTGGCTA 420 
GTOTAATTAC AAAGGGAACC AAGGAAAGGC AATACTTAGA TGAAGAGTTT GTTCTTCGAG 480 
TGATGACTCA GTTGACTCTG GCCCTGAAGG AATGCCACAG ACGAAGTG AT GGTGGTCATA 540 
CCGTATTGCA TCCGG ATCTT AAACCAGCCA ATGTTTTCCT GGATGGCAAO CAAAACGTCA 600 
AGCTTGG AGA CTTTGGGCTA GCTAGAATAT TAAACCATGA CACGAGTTTT GCAAAAACAT 660 
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TTGTTGGCAC ACCTTATTAC ATGTCTCCTG AACAAATGAA TCGCATGTCC TACAATGAG A 720 
AATCAGATAT CTGGTCATTC GGCTGCTTGC TGTATGAGTT ATOTGCATTA ATGCCTCCAT 780 
TTACAOCTTT TAGCCAOAAA GAACTCGCTG GGAAAATCAG AGAAGGCAAA TTCAGGCGAA 840 
TTCCATACCG TTACTCTGAT GAATTGAATG AAATTATTAC GAGGATGTTA AACTTAAAGG 900 
5 ATTACCATCG ACCTTCTGTT GAAGAAATTC TTGAGAACCC TTTAATAGCA GATTTGGTTG 960 

CAGACGAGCA AAGAAGAAATCTTGAGAGAA GAGGGCGACA ATTAGGAGAG CCAGAAAAAT 1020 
CGCAGGATTC CAGCCCTGTA TTGAGTGAGC TGAAACTGAA GGAAATTCAO TTAC AOOAOC 1 080 
GAOAGCGAGC TCTCAAAGCA AGAGAAGAAA GATTGGAGCA GAAAGAACAG GAGCTTTGTG 1140 
TTCGTGAGAG ACTAGCAGAG GACAAACTGG CTAGAGCAGA AAATCTGTTG AAGA ACTAC A 1200 

10 GCTTOCTAAA GGAACGGAAG TTCCTGTCTC TGGCAAGTAA TCCAGAACTT CTTAATCTTC 1260 
CATCXmCAGT AATTAAGAAG AAAGTTCATT TCAGTGGGGA AAGTAAAGAG AACATCATGA 1320 
GGAGTGAGAA TTCTGAG AGT CAGCTCACAT CTAAGTCCAA GTGCAAGGAC CTGAAGAAAA 1380 
GGCTTCACGC TGCCCAGCTG OGGGCTCAAG CCCTOTCAGA TATTGAGAAA AATTACCAAC 1440 
TGAAAAGCAG ACAGATCCTG GGCATGCGCTAGCCAGGTAa AGAGACACAG AGCTGTGTAC 1500 

1 5 AGGATGTAAT ATTACCAACC TTTAAAGACT GATATTCAAA TGCTGTAGTG TTGAATACTT 1560 
GGCCCCATGA GCCATGCC I T U TO I A TAOT ACACATOATA TTTCGGAATT GOTTTTACTG 1620 
TTCTrCAOCA ACTATTOTAC AAAATQTTCA CATTTAATTT TitTI 11CI1C 111 1AAGAAC 1680 
ATATTATAAA AAGAATACTT TCTTGGTTGG GCTTTTAATC CTGTGTGTGA TTACTAGTAG 1740 
GAACATGAGA TGTGACATTC TAAATCTTGG GAGAAAAAAT AATATTAGGA AAAAAATATT 1800 

20 TATGCAGGAA GAGTAGCACT CACTGAATAG TTTTAAATGA CTGAGTGGTA TGCTTACAAT I860 
TCTCATGTCT AGATTTAAAT TTTAAGTCTO AGATTTTAAA 1U1TI MUAO CTTAGAAAAC 1920 
CCAGTTAGAT GCAATTTGGT CATTAATACC ATGACATCTT GCTTA TAAAT ATTXXATTGC 1980 
TCTGTAGTTC AAATCTGTTA GCTTTGTGAA AATTCATCAC TGTGATGTTT GTATTCTTTT 2040 
JUHIICI G TTTAACAGAA TATGAGCTST CTGTCATTTA CCTACT1C1T TCCCACTAAA 2100 

25 TAAAAG AATT CTTCAGTTA 

Sen ID NO- 96 Protein sequence 
Protein Accessioa #: NP002488 

30 1 11 21 31 41 51 

GIREFNPNIM ANEVERINM1 TOSISGENEA RBLATEDKIN ASEHOMOSAP IENSMPSRAE 60 

DYBVLYTKJT gsygrcqkir rksdgkilvw KELDYGSMTE aekqmlvsev NLLRELKHPN 120 
IVRYYDRIID RTNTTLYIVM EYCEGGDLAS VTTKGTKERQ YLDEEFVLRV MTQLTLALKB 180 

35 CHRRSDGGHT VLHRDLKPAN VFLDGKQNVK LGDFGLARIL NHDTSF AKTF VGTPYYMSPB 240 
QMNRMSYNEK SDIWSLGCLL YELCALMPFF TAFSQKELAG KIRBGKFRRI pVRYSQELNB 300 
UTRMLNLKD YHRPSVEHLENPUADLVA DBQRRNLERR GRQLGEPEKS QDSSPVLSEL 360 
KLKEIQLQER ERAJLKAREHR LEQKBQELCV RERLAEDKLA RAENLLKKYS IXKBRKFLSL 420 
ASNPELLNLP SSVKKKVHF SGRSKENIMR SENSESQLTS KSKCKDLKKR LHAAQLRAQA 480 

40 LSDIEKNYQL KSRQUjGMR 

Seq ID NO: 97 DNA sequence 

Nucleic Add Accession NM_007050l2 

Coding sequence: IB5-4576 (underiined sequences correspond to start and stop codons) 

45 

1 11 21 31 41 51 

OCTOCCX50CT CAGTTCGCGC CGOGCCTCGG CTTGGAACGC AGGAGGGOOG GCTCCGGGAG 60 
CCCGAGCGGA GCCAGCCGCG CGCACAGCCA GCGGCCGCGC CGGCGATGCG GGGOCACCOC 120 

50 GCGCCCGCCCCAGTCCCGGCCCCGGCCC^ 180 
COQ GATGG CG AGOCTCGOCG CGCTCGCCCT CAGCLTGCfC CTOAGGC TQC AGCTOCCGCC 240 
ACTGCCCGGC GCCCGGGCTC AGAGCGCCCC AGGTGGCTGrT TCCTTTGATG AGCACTACAG 300 
CAACTGTGGT TATAGTGTGG CTCTAGGGAC CAATGGGTTC ACCTGGQAGC AGATTAACAC 360 
AACGGAGAAA CCAATGCIGG ACCAGGCAGT GOOCACAGGA TCTTTCATGA TGGTGAACAG 420 

55 CTCTOGOAGA GCCTCTGGCC AGAAGGCCCA CLT1C1CCTG CCAACCCTGA AGGAGAATGA 480 
CAOCCACTGC ATOG ACTTCC ATTACTACTr CTCCAGCCGT GACAGGTOCA GCCCAGGGGC 540 
CTTGAACGTC TAOGTGAAGG TGAATGGTGG COCOCAAGGG AACCCTGTGT GGAATGTGTC 600 
CGGGGTCGTC ACTGAGGGCT GGGTGAAGGC AGAGCTCGCC ATCAGCACTT TCTGGOCACA 660 
TTTCTATCAO OTGATATTTG AATCCOTCTC ATTGAAGGGT CATCCT GGCT A CATOGCCGT 720 

60 GGACGAGGTC CGGGTOCTTG CTCATCCATG CAGAAAAGCA CXnCATTTTC TGCGACTCCA 780 
AAACGTGGAG GTGAATGTGG GGCAGAATGC CACATTTCAG TGCATTOCTG GTGGGAAGTG 840 
GTCTCAGCAT GACAAGCTTT GGCTCCAGCA ATGGAATGGC AGGGACAOGG GOCTGATGGT 900 
CACOOGTGTG GTCAACCACA GGOGCTTCTC AGCCACAGTC AGTGTGGCAG ACACTGCOCA 960 
GCGGAGOGTC AGCAAGTACC OCTGTGTGAT CCGCTCTGAT Q GTGGG TCTQ QTGTOTCCAA 1020 

65 CTACGCGOAG CTGATCGTGA AAGAGCCTCC CAOGCCCATT GCTCCOCCAG AGCTGCTGGC 1080 
TGTOGGGGCC ACATACCTGT GGATCAAGOC AAATGCCAAC TCCATCATCG GGGATGGCOC 1 140 
CATCATCCTG AAGGAAGTGG AATATCGCAC CAOCACAGGC AOGTOOGCAG AGACOCACAT 1200 
AGTCG ACTCT CCCAACTATA AGCTGTGGCA TCTGGAOCCC GATGTTGAGT ATGAGATCOG 1260 
AGTGCTOCTC ACAOGACCAG GTGAGGGOGO TACGGGACCG CCAGGGGCTC COCICAOCAC 1320 

70 CAGGAOCAAG TGTGCAGATC CGGTACATGG CCCACAGAAC GTOG AAATCG TAGACATCAG 1380 
AGCOCGGCAG CTGACCCTGC AGTGGG AGCC CTTCGGCTAC GOGGTGACCC GCTGCCATAG 1440 
CTACAACCTC ACCGTGCAGT ACCAGTATGT GTTCAACCAG CA GCAGTAC G AGGCCGAGGA 1500 
GGTCATCCAG ACCTOCTCCC ACTACACOCT GCGAGGCCTG CXHXCCTICA TGACCATCCG 1560 
GCTGCGACTC TTGCTGTCTA ACCCCGAGGG CCGAATGGAG AGCGAGGAGC TGGTGGTGCA 1620 

75 GACTGAGGAA GACGTTCCAG GAGCTGTTCC TCTAGAATCC ATCCAAGGGG GGCCCTTTGA 1680 
GGAGAAG ATC TACATCCAGT GGAAAOCTCC CAATGAGACC AATGGGGTCA TCACGCTCTA 1740 
CGAGATCAAC TACAAGGCTG TCGGCTCGCT GGACCCAAGT GCTGACCTCT CGAGOCAGAG 1800 
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GGGG AAAGTG TTCAAGCTOC GGAATGAAAC CCAOCAOCTC TTTGTGGGTC TGTACCCAGG 1860 
GACCACCTAT TCCTTCACCA TCAAGGCCAG CACAGCAAAG GGCTTTGGGC CCCCTGTCAC 1920 
CACTCGGATT GCCACCAAAA TTTCAGCTCC ATOCATGCCT GAGTAOG ACA CAGACACCCC 1980 
ATTGAATGAG ACAGACACGA CCATCACAGT GATGCTGAAA CCCGCTCAGT CCCGGGGAGC 2040 
5 TCCTGTCAGT GTTTATCAGC TGGTTGTCAA GGAGGAGCGA CTTCAGAAGT CACGGAGGGC 2100 
AGCTGACATT ATTG AGTGCT TTTCGGTGCC CGTGAGCTAT CGGAATGCCT CCAGCCTCGA 2160 
TTCTCTACAC TACTTTGCTG CTGAGTTGAA GCCTGCCAAC CTGCCTGTCA CCCAGCCATT 2220 
TACAGTGGGT GACAATAAGA CATACAATOO CTACTGGAAC CCICCTCTCT CTCCCCTGAA 2280 
AAGCTACAGC ATCTACITCC AGGCACTCAG CAAAGOCAAT GG AGAGACCA AAATCAACTG 2340 

10 TGTTCGTCTG GCTACAAAAG CACCAATGGG CAGCGOCCAG GTG ACCCCGG GGACTCCACT 24C0 
CTGCCTCCTC ACCACAGGTG CCTCCACCCA GAATTCTAAC ACTGTGGAGC CAGAGAAGCA 2460 
GGTGGACAAC ACCGTGAAGA TGGCTGGOGT GATCGCTGGC CTCCTCATGT TCATCATCAT 2520 
TCTCCTGGGC GTGATGCTCA CCATCAAAAG GAGAAGAAAT GCTTATTCCT ACTCCTATTA 2580 
CTTGTCOCAA AGGAAGCTGG CCAAGAAGCA GAAGGAGAOC QAGAGTGGAG OCCAGAGGGA 2640 

15 GATGGGGCCT GTGGCCTCTG CCGACAAAOC CACCACCAAG CTCAGCGCCA GCCGCAATGA 2700 
TGAAGGCTTC TCTTCTAGTT CTCAGGACGT CAACGGATTC ACAGATGGCA GCCGCGGGGA 2760 
GCTTTCCCAG CCCACCCTCA COATCCAGAC TCATCCCTAC CGCACCTGTG ACCCTGTGGA 2820 
GATGAGCTAC CCCCGGOACC AGTTCCAACT COCCATCOGG GTGGCTGACT TGCTGCAGCA 2880 
CATCACGCAG ATGAAGAGAG GCCAGGGCTA CGGGTTCAAG GAGGAATACG AGGCCTTACC 2940 

20 AGAGGGGCAG ACAGCTTCGT GGGACACAGC CAAGGAGGAT GAAAACCGCA ATAAGAATCG 3000 
ATATGGGAAC ATCATATCCT ACGACCATTC CCGGGTGAGG CTGCTGGTGC TGGATGGAG A 3060 
CCCOCACTCT GACTACATCA ATGCCAACTA CATTOACGGA TACCA TCOAC CTCGGCACTA 3120 
CATTGCG ACT CAAGGTOCG A TGCAGGAGAC TGTAAAGGAC TTTTGGAGAA TGATCTGGCA 3180 
GGAGAACTCC GCCAGCATCG TCATGGTCAC AAAOCTGGTG GAAGTGGGCA GGGTGAAATG 3240 

25 TGTGCGATAC TGGCCAGATG ACACGGAGGT CTACGGAGAC ATTAAAGTCA CCCTGATTGA 3300 
AACAGAGCCC CTGGCAGAAT ACGTCATACG CACCTTCACA GTOCAGAAGA AAGGCTACCA 3360 
TGAG ATCCGG GAGCTCCGCC TCTTCCACTT CACCAGCTGG CCTG ACCACG GCGTTOCCTG 3420 
CTATGCCACT GGCCTTCTGG GCTTCGTCCG CCAGGTCAAG TTCCTCAACC CCCOGGAAGC 3480 
TGGGCOCATA GTGGTCCACT GCAGTGCTGG GGCTGGGCGG ACTGGCTGCT TCATTGOCAT 3540 

30 TGACACCATG CTTGACATGG CCGAGAATGA AGGGGTGGTG GACATCTTCA ACTGCGTGCG 3600 
TG AGCTCOGG GCCCAAAGGG TCAAOCTGGT ACAGACAO AO GAGCAATATG TGTTTGTGCA 3660 
CGATGCCATC CTGG AAGCGT GCCTCTGTGG CAACACTGCC ATCCCTGTGT GTGAGTTCCG 3720 
TTCTCTCTAC TACAATATCA GCAGOCTGOA COCCCAGACA AACTOCAGCC AAATCAAAGA 3780 
TGAATTTCAG ACCCTCAACA TTGTGACACC OCGTGTGCGG CCCGAGGACT GCAGCATTGG 3840 

35 GCTCCTGCCC GGGAAOCATG ATAAGAATCG AAGTATGGAC GTGCTGCCTC TGGACCGCTG 3900 
CCTOCCCTTC CTTATCTCAG TGGACGGAGA ATCCAGCAAT TACATCAACG CAGCACTGAT 3960 
GGATAGCCAC AAGCAGCCTG CCGCCTTCGT GGTCACCCAG CACCCTCTAC CCAACACCGT 4020 
GGCAGACTTC TGGAGGCTGG TG I 1CGATTA CAACTGCTCC TCTGTGGTGA TGCTGAATGA 4080 
GATGGACACT GCCCAOTTCT GTATGCAGTA CTGGCCTGAG AAGACCTXXG GGTGCTATGG 4140 

40 GCCCATCCAG GTGGAGTTCG TCTCCGCAGA CATCGACGAG GACATCATCC ACAGAATATT 4200 
CCGCATCTGT AACATGGCCC GGCCACAGGA TGGTTATCGT ATAGTCCAGC ACCTXXAGTA 4260 
CATTGGCTGG CCTGCCTACC GGGACACGCC CCOCTCCAAG COCTCTCTGC TCAAAGTGGT 4320 
CCGACGACTG GAOAAGTGGC AGG AGCAOTA TGACGGGAGG GAGGGACGTA CTGTGGTCCA 4380 
CTGCCTAAAT GGGGGAGGCC GTAGTGGAAC CTTCTGTGCC ATCTGCAGTG TGTGTGAGAT 4440 

45 GATCCAGCAG CAAAACATCA TTGACGTGTT CCACATCGTG AAAACACTGC GTAACAACAA 4500 
ATOCAACATG GTGGAG AOCC TGGAACAGTA TAAATTTGTA TACG AGGTGG CACTGGAATA 4560 
TTTAAGCTCC TTTTAGCTCA ATGGGATGGG GAACTGCCGG AGTCCAGAGG CTGCTGTGAC 4620 
CAAGCCCCCT TTTGTGTGAA TGGCAGTAAC TGGGCTCAGG AGCTCTGAGG TG GCACCCTG 4680 
CCTGACTCCA AGG AGAAGAC TGGTGGCCCT GTGTTCCACG GGGGGCTCTG CACCTTCTGA 4740 

50 GGGGTCTCCT GTTOCCGTGG GAGATGCTGC TCCAAAAGGC CCACKJCTTCC TTTTCAACCT 4800 
AACCAGCCAC AGCCAAGGGC GCAAGCAGAA GTACACCCAC AAGCAAGGCC TTGGATTTCT 4860 
GGCTCCCAGA CCACCTGCTT TTGTTCTGAG TTTGTGGATC TCTTGGCAAG CCAACTGTGC 4920 
AGGTGCTGGG GAGTGGGAGG CTCCCCTGCC CTCCTTCTCC TTAOQAGTGG AOOAOATOTO 4980 
fOTTCTGCTC CTCTAGGTCA TGOAAAAGAT TGAGGCTCTT GOGGGTCACT GCrcrGCTGC 5040 

55 CCCCTGCAAC CTCCTTCAGG GGCCTCTGGC ACCAGACATT TGCAGTCTGG ACCAGTGTGA 51C0 
CCTTACGATG TTCCCTAGGC CACAAGAGAG GCCOCCCATC CTCACACCTA ACCTGCATGG 5160 
GGCTTOGCOC ACAAGCATTC TGTAOCOCTT COCCAGCCTG GGCCTTGACC GTOCAGCATT 5220 
CACTGGCCGG CCAGCTGTGT CCACAGCAGT TTTTG ATAAA G G ' IGTiCllT G ClI AlllGf 5280 
GTGGTCAGTG GGAGGGGGTG GAACTGCAGG GAACTTCTCT GCTCCTCCTT GTCTTTGTAA 5340 

60 AAAGGGACCA CCTCCCTGGG GCAGGGCTTG GGCTGACCTG TAGGATGTAA CCCCTOTOTT 5400 
TCTTTGGTGG TAGCTTTCTT TGGAAGAGAC AAACAAGATA AGATTTG ATT ATTTTCCAAA 5460 
GTGTATGTGA AAAGAAACTT TCTTTTGGAG GGTGTAAAAT CTTAGTCTCT TATGTCAAAA 5520 
AGAAGGGGGC GGGGGAGTTT GAGTATGTAC CTCTAAGACA AATCTCTCGG GCCTTTTATT 5580 
TTTTCCTGGC AATGTCCTTA AAAGCTCCCA OCCTGGGACA GCATGCCACT GAGCAAGGAG 5640 

65 AGATGGGTGA GCCTGAAGAT GGTCCCTTTG GTTTCTGGGG CAAATAGAGC ACCAGCTTTG 5700 
TGCATAATTT GGATGTCCAA ATTTO A ACTC CTTCCTAAAG AAACCCAGCA GCCACCTTOA 5760 
AAAAGGCCAT TGTGGAGCCC ATTATACTTT GATTTAAAAT AGGCCAAGAG AATCAGGCCT 5820 
GGAGATCTAG GGTCTTGTOC AAAGTGTGAG TGAGTCAATG AGAGGGAAOC AACATTTGCT 5880 
AAGTCTCTAC TGTATGCCAG GG ATCATGCT TGGCACTTTC CATAGGACAT TTCACACAGT 5940 

70 CCTTAGAACC COCAGG AGAG AGCTACTGAC TTQTTATCAT CTCCATTTG A TCATCTCCTC 6000 
CAATOAGOAA ACOCACGCAC CTTCCTTAGT AATGAAATCC TGGGTTCCAA AGGGGCAGGT 6060 
AATGGCAATG AGACTTCTCC GTGCTGTTTT CTTCATCTTC TCTAAGCCAA GCAATTATTT 6120 
TATGG AGGGA AAATAAGGCC AGAAACTTCT GAGCAGATAA CTCCACAAAT GGAAATTTAG 6180 
TACTTTCTTC CTGATGGCAG TTCTTCTGGG AAGCGCAGAA TTTCAGATAT ATTTTAGTAA 6240 

75 CACATTCCCA GCTCCCCAGG AAAGCCAGTC TCATCTAATT TCTTAGTCAG TAAAAACAAT 6300 
TOOCTGTTCC TTCAGGCTAT GAATGG ACCA GCCAGGG AAA CTCTOG ACCT TGATCTCTAG 6360 
(XAGTGCTTA GGCCCAATAT CTOACAGCCT CAGGTGGGCT GGGACCTAGG AAGCTCCATC 6420 
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TTGAAGGCTG OTCTAGCCCC AOACAGGGCA TGAGGOOCAO AQAATTCAAG AAGGTACAGC 6480 
TTTGGCCCTC AAGAGCCCAC TGTATGCTGG GGAAATGG AA CCATGGTGCA GTAGTGTGGA 6540 
GTGGATGAGT GTTCCATGAG CCTAGGAGCA AGAAAGTCTC TTCXX5CCTCG GGCTTOCTGG 6600 
AGAAGGGGAC GTCCATTCCT GCTGGGTCTT AACAAGCATA AAAAGGAAAA AAAGGAAACT 6660 
CAGGCAAAGG GATCCATATG TGCAATGGCA AAGAAATGTG AAAAGGCATT GGGAGAAGCA 6720 
GTCTGGGGGA GGCCAGCCCA GTGCGGGCAC AGCACAACAC GGGGAOCAGC AAGAGATGAG 6780 
CCAGGGTCCA GGAGACAGAT GCCCATCXjCG AGTACAGACT TTGTCCTATT GGCAACAAGG 6840 
AGTCCATGGA GCTTTAGAGA GATGCACTCA GCTTCGTGTT GGCCAAGACT CCTTCTGGGC 6900 
CAATGGGGCT GCCTCTTTTC CTTTCATCAG ACACTGTGAA AACATTCCCT TAAGCGTGCA 6960 
CI I II 1A ATA TCACATCTAT TWlTr n nCl GCICA'UU 17 1 IGTIXSCTOO AACTAAATAT 7020 
GCAATGGATC ATGAGACTCA GATTCTATGA GAAACCCAGG GTCTCTGCTT TACCACGGAG 7080 
CAGGGTCACC AACCCAGA7C TOCAGGOOCA TGAGGATGGA ACATGAAAGG AGGCGACAAA 7140 
AGT1GCTTOC ATTGGCATGG GCTCTGGAGC TGTCCAGAAG TCCAGGGACA CCAGACTTGA 7200 
TCAAGGAAGG GCTGTCACTT TAG AGGTTCA AAAGGAAGTG CCTCAAAGCA AAGGCAAGCA 7260 
AAGGAACCCC ACGATGAACT TGCTCTTTTC CTTTGATGAG CCTCTCCCCA GGTGTATTTC 7320 
AGCAGACCCC GGGG AOCCAC CCCCACTGGG CCTGCTGGCC TCCCTCGGCT CCAGCCCAAT 7380 
GCCCCAGCTG GCCTTCCCCA GCCTGCAAGG AGCCTGTAGC ATGGCAAATC TGCCTGCTGT 7440 
ATGCTATTTT CTTAGATCTT GGTACATCCA GACAGG ATGA GGGTGGAGGG AGAGCTATTT 7500 
AACACAAATC CTAAGATTTT TTTCTGCTCA GGAAGGGGTG AAATAGCTGG CAGATACAAA 7560 
AG ACAGTGGC TTTTATCATT TTAAATGGTA GGAATTTAAG GTGTGACTTC AGGGAGAAAC 7620 
AAACTTGCAA AAAAAAAAAA TCTCAGGCCA TGTTGGGGTA ACOCAGCAAG GGCCAGTGAT 7680 
GATTTCCCCC AGCTCATCCC CTTATTTTCC CACAACCCAA CCATTOCTA AAGCAGGACA 7740 
GTGAATAGGT CTTAGGCCAG TGCACACAGG AAGAAATTGA GGCTTATGGA TGGGGATGAC 7800 
TTCOCTAAG A TCCCATGGG A CAAGG ATGTG GCAAGGCTTG GATGAG ATGG GGCAOCAGTG 7860 
CCCAGGAATT TGAACATTTT CCTTTACCCA GGAAATCTCC GGAGCCAACA CCACCACCCC 7920 
CAGGGGGTCT CCCCACCCCA OCOCATTTAC AGGGTGAGCT CAGCCTGTCA TGAGCAGAGG 7980 
AAAATATTAT TAATGCTCTC TGAGTCTTTA CAACAGGAGC TCTTACCTCA TAGATGTGGG 8040 
CTCTGTTTGG GGAAGATGCA AGG AAGTAAT GAGAAGCOCA GGAAATTTCT CCACCTGTGT 8100 
TTATGGCCTA AATAGCTTCA GGATGTATCT TAGCTGCACT CCAACATTGC ATCCTTTCTG 8160* 
GGGTGAAGAA TCTGGGCCAA CCAGGGGTCC TTGGGCCTCT AGAAGGCCAC AGTAGGCCTC 8220 
TCTTTGTGGG AATGGAAGGG G ACAGTTTGC TTTTAGTGCT GGCCCTCTCT GTGGGTGTGG 8280 
CCTGCAAAGG AACCAACAGA OOCTATGCTG GGGACTCTAA CATGTGAGCT CATTAAATTC 8340 
TTCCAGCATT CTAAAGGAGG GTTTGTGATT GTCACCATTT ACTGATGAGG AAACTAAGGC 8400 
TCCTAGGGGA GAAATCACTT GCCCACAGTT CCACAGCTAG TGAGTGAATG AACCAGGATT 8460 
TAAACCGGTT TTTTCTCACT ACAG AG ACAA TATTTTTCCA CCATTGTATC TCACATTTTT 8520 
CCCAGGAGGT TACCCATAAC AGAAG AGACT AGAGTGGAAC AGATACGTCA GTGGATAAAG 8580 
CTCAAAGCAA ACAACAGTAA GCTTAAAATT CCTTCATAGT CTCATGTTTT AOGTTCACAA 8640 
TTCATGCAAA ATTTGCATTC CACTTTCTGA TTTAGCCTTG TTGGTTTTAA TATGACTCTA 8700 
TGAATATTTC AAAAAAAAATGTGCrCTCl 1 CCTCATGTTG 1 1L1G1 TTCACCCCGC 8760 
TATGACGGAC CCTAGGTCAG CTGGTCTTCA GCTTG ACCCT AGAATTGACT CTAGGAGCAO 8820 
TGACCCTGCT GCCTCCCAGA GCCAGTTATA GGCTCAAGAT CAAGACCAAC TGACCTTCTC 8880 
CTAGGCAGCT CCTTTGGTGT GTGGGTGCTC TGACCTCACT GTTCATGAGG GGACCTCAAC 8940 
TAAGGCATCT TCCAGTTGGG TGCTGGAAGG AACCCATTAA CTCACACTAG AATGATGAGG 9000 
ATTTGCTCAT CTGGCGTGGA GAAGGATGAG COCACAAAAC CCTAAAGGGA AAAGAGAAGC 9060 
TGGACACAGC TGTACTCAGC AGATTCCTGA ATGCTAGGCT GGAAAGTGGT GCCTGTTGTC 9120 
CAAGTGGAGT CACATGGTTG CTAATGTGGG CAAGTCTGAG GACACACTTC ATGAGCAGCT 9180 
GGGGTCTGG A AGGCTCCTCA CTTTAOOCTA GCCACACATA ATTACTGGGT GCCTACAGCA 9240 
OCTAGCACCT TGGAGGGGGC ACTATTAGGA AATCGAGATT ACTATGGCAC AATTAATTOC 9300 
TGGGTAAGGC ATGGGGTTGT GGTGGACAG A GCTCAGTCTT TAGTTTGAAC GAAAACATAC 9360 
ATACATGAAA AACATACATG AAAAAAGG AC CCTCATCAAC ATTAGAAGGG GTAGATTTGG 9420 
AGCACTTTAG GCAGG AAAAC AGGAACGCAA GGCCAGGAAA CTGGAACCCA GTGAATACTC 9480 
AGAACCG AGG ATGCAGATG A CTTATTTAGC AAAATGGTCA CTTCTGTGAC ATAGCTGG AG 9540 
AAAGGATGGG TAACAGCTTG CCAGAGCCAC TTGGAACAAG GGCAAATCTC AGTGTCTGGG 9600 
GCAAAAGATG ATGCATTTCC CTCTGAOCCA TCATGTTTAT TCATCCTCCA CTCCCCATTG 9660 
CCACACTAGC TCTTGCTOTA AGTCCTCACC AGG ATCTACA TTTTXTCGTC GCTGGTGGGA 9720 
ACCCCTTAGA GTACATAGAG GTATCAGTCC AGTAAGACTG CTCTACACAA CAGAAGTGAG 9780 
GCCCAGGGAG TAGCAGCCAG GCCCTTATOC TGTTAOCTCT GCAGG AGTGA CTGCCCAACC 9840 
CAGATCCAGA GACATTGAAG GAAATGATAA TTCCTTGGTA CCTCACTGCC TTGGGACAAA 9900 
ATGAAQAAAG CCACCCTTCC TTAGGCTGCA GCTTGCCACT CCTGGGCTGG GTAAACAGGT 9960 
CATCAGCACC AGGCTCAACC AGGAGTAACA TTCTGGAAGA CATGGGTGAG CCCAAGAGGA 10020 
AGCATGAACA GGACGCTGTT CCTAAGTCAT GTCAACAGGT TGTGCTGGGC CAGGATCCCC 10080 
AGGGAAAAAA ATGGTCAACC CAACTGGAGG GTAGGTTAGA AGAAAAAAAA CATAAACGTG 10140 
GATAGTCATG TCATCTCAAA TCCCTGACTT GGCTTCCCCA TTACTTGACA GTCTGAGCTC 10200 
CnCTTAGCC TGTGACCAGC TTCAAATCAC AGCCAAGTAA AACAAGGAAA TAGGAAAAGT 10260 
AAATCCAACT AG AAGAGACA AGCTGAGATT CAGATTTGTT TACTCCTCCC ATGCAAAOTT 1 0320 
TCCCTGTTGG AGGTTTTCCA TGTATACATG TCTAGAAGTG ATAGAATGCA AGGCCTTGGC 10380 
TTTGTCTTGC AGGGATCTGC CTTTGAGGTC ATAGACTGAA CAGCAGGGAG AGAGGTTAGT 10440 
GGTGGAGTGT GGGGGGAGCT GTTCTAGCTC CAGTTTCTTC TGACACATTT TTCAGGATCA 10509 
TGQATCTO AT CCTCCGAAGC ACAGCAGAGA TATCTAAGCC ATATTTGTGC ACATGAGCAG 10560 
ACTCTTCTAG TTTTTTAGTA ACCAGGGATO GGCTTTTGCA TOGCACTGAC TATAGAGATG 10620 
TCTTGTAGAG ATCAAGCCAG TCTTTTGCAT CCCACCTGCC CACCTOCAGA AGAGATGGGA 1O680 
AAAGGTCATC AAAGGGCATT CACCAACTGA AATCCACTCA TGAATGTTAG GTCTCTAAAA 10740 
GGAGGCATCA ACACTCACAA TGGTAGCCTC CAAACCTAGC ATCCCACCTA TCTAAGAGCT 10800 
CAGGGGTCGT CCACTGGGGC AG ATACAAGG GAAGTGCAAG GGCTCAGGAT GAAAGAAAAT 10860 
CTATTGGGAA GAGTTTTAGG GGCTTGATCA TTATGGGGCT TCCTTCTATA TCTGAGAACT 10920 
GCTCTGGGTG GTGAGATGTG GACTCTGATC CTTAATTGGA ATGTTCGGAG AATGAGTGTC 10980 
TGGTGGCCTT GAAGTGTTGG ACAGAAAAGT ATCAGTATAA AAGCCTGGAG CTCAGGGTAA 1 1040 
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TTAATGTAGT TCATGGTTCC TTAGTGAGCA GOACTCTTCC ATGTGGAGGA GAAAGGGTCA 1 1 100 
TAGGAAGTAA ACCACXAAAA TTACAAAATT GAGTCTCTGT ACAATTACTT CAGTGCCTTT 1 1 160 
GGGCTTATGA ATACAAATCA GTGGGOCTTC TCTATGATOO TOCAACAAAC TCTCAGTGTC 1 1220 
CACOCTCTCC CTGTATCTCC CATGGAAGAT GAATAATGTC AGGTGTTCTT TGGGTCAAAG 1 1280 
GOCCCAGGGC AGTCTGGAGG CTTAGAGGGC AGAGTGGTGT CATTCCATGT AAAOTTAGGC 1 1340 
TTCTGAGGGG TCAGGCAGAA TATGGTGTCC ATATCTTCCA TAOCTCTGCA GATTCTTGGA 1 1400 
TGAAGTCAAG CACAGTTTGC TAGACCCAGG TGACTOCTCT GAGTATAACT AGGACCCATG 1 1460 
AGTGAAACTT AATAGCTGTA AGGAAGAACC TGCTGTCTGC CAGAGAGGAT AAGCTGCCCA 1 1520 
TCTCAGCAGC TGTCTAAAAG AAGGCAGGTG TCTCTTTAAA GGGAAGAGAA GCATTGGTGA 1 1580 
AATGGATTTC AGGTCACTTC CATTCCAGAT GGGTGAGATC TTGTGGAGCT GGGATCATGT 1 1 640 
TTGAACTCAT TCATACCTGT AGAGCACGAA TCCAAGTAGA TTGTGTTTGG TCTGTACAGG 1 1700 
CTGAAGCCOC CTGCTCTOCC ACCCAAGTGC CCCCACTGAG CAGGCCAACA TGCTGTTGTG 1 1760 
GCCACATATA CTGGGCTGAT OCAGGCTGGT TATCACCAAA CAGCAAACCA TAGGGAACAG 11820 
CTGCTTTGOC ATAGACCCAA TAOOCATGTA GATCTCTCAT GAGAGCAGCC ATAACTCAGA 11880 
CCCACTGACC AACAGGGCCA TGAGTGACAG CCAGAACCAG TGAAGGTCCA AGTAGGACAC 11940 
AGAGCAGGGC TTTTCTTACC ATACACATTA TCTCCAGAGG TTATTTCTAC CCCACTCCCT 12000 
ATTCAAGGCC TGTTGGAGCA CACTGCAAAA GCAAAAGCAC AGTAACICAA TTTACACATG 12060 
ATTATAATCA TTTCCAGTGC ACACATTTCA TCAOCAGGTG GATCCTGAGC TAGOCCATGT 12120 
AAATOCGGGT TAACOCATAT TGGTAATCAT ACTCAAAAGC AC1 1 1 1CACC CTACATTCTA 12180 
CTAGCCAATC AAAGACAAAG AGTTGTGGCC TCTACCATTG CXTTGGCITC 7GGACACCCT 12240 
CACAAGCTAT (XCAAGGTTC COGCTCAACT OCAGGGAGGC TGACATCTTC ACATCCACTG 12300 
GGCATATAAT ATTGCATGAG ACCAAAGTCT CCACACTCIT TGCAGCCTCC TCCATGAATC 12360 
OCAATGGCCT GCACTTGTAC AGTTTGGGTG TTTGATAGAT AAAGCACGTA TGAGAAGAGA 12420 
AAACAAAATA AATCAACTTT TTAAAAAAGC CAGCACTGTG CTGTCAATGT 1 I'l l 1 1 1 VIC 12480 
TTTTCAATTC TAGCTCAGAA AAGCAGAAGG TAAATAATGT CAGGTCAATG AATATCAGAT 12540 
ATATTTTTTG ACTGTACATT ACAGTGAAGT GTAATCTTTT TACAOCTGCA AGTOCATCTT 12600 
ATTTATTCTT GTAAATGTTC CCTGACAATG TTTGTAATAT GGCTGTGTTA AAAAATCTAT 12660 
ACAATAAAGC TGTO ACOCTO 

Scq ID NO: 98 Protein sequence: 
Protein Accession ft NPJ00898IJ 

I 11 21 31 41 51 
1 f I I t 1 

GIREFNPPRO TE1NTYROS1 NEPHOSPHAT ASERECEPTO RTYPETHOMO SAPIHN SMAS 60 
LAALALSLLL RLQLPPLPGA RAQSAPGGCS FDEHYSNCGY SVALGTNGFT WEQINTTEKP 120 
MLDQAVPTGS PMMVNSSGRA 9GQKAHLULP TLKEMDTHCI DFHYYFSSRD RSSPG ALNVY 180 
VKVNGGPQGN PVWNVSGWT EGWVKAELAI STFWPHFYQV IFESVSLKGH PGYIAVDEVR 240 
VLAHPCRKAP HF1JUUQNVEV NVGQNATFQC IAGGKWSQHD KLWLQQWNGR DTALMVTRW 300 
NHRRFSATVS YADTAQRSVS KYRCVIRSDG GSGVSNYAEL IVKEPPTP1A PPELLAVGAT 360 
YLWKPNANS IIGDGPtILK EVEYRTTTGT W AETOTVDSP NYKLWHLDPD VEYHRVLLT 420 
RPGEGGTGPP GAPLTTRTKC ADPVHGPQNV ETVDIRARQL TLQWEPFOYA VTRCHSYNLT 480 
VQYQYVFNQQ QYEAEEVIQT SSHYTLRGLR PFMTIRLRLL LSNPEGRMES EELWQTEED 540 
VPG A VPLESI QGGPFEEKIY IQWKPPNETN G VTTLYEINY KA VGSLDPSA DLSSQRGKVF 600 
KLRNETHHLF VGLYPGTTYS FTKASTAKG FGPPVTTR1A TKISAPSMPE YDTDTPLNET 660 
OTITrVMLKP AQSRGAPVSV YQLWKBERL QKSRRAADH ECPSVPVSYR NASSLDSI -HY 720 
FAAELKPANL PVTQPPTVGD NKTYNGYWNP PLSPLKSYSI YPQALSKANG ETKINCVRLA 780 
1KAPMGSAQV TPGTPLCXXT TGASTONSNT VEPEKQVDNT VKMAGVIAGL LMFHILLGV 840 
MLTIKRRRNA YSYSYYLSQR KLAKKOKBTQ SGAQREMGPV ASADKPTTKL SASRNDEGFS 900 
SSSQDVNGPT DGSRGELSQP TLTIQTHPYR TCDPVEMSYP RDQFQLAIRV ADLUQHTTQM 960 
KRGQGYGFKH HYEALPEGQT ASWDTAKEDB NRNKKRYGM KYDHSR VRL L VLDGDPHSD 1020 
YINANYIDGY HRPRHYIATQ GPMQETVKDF WRMIWQEN5A STVMVTNLVB VGRVKCVRYW 1080 
PDDTEVYGDI KVTUBTEPL AEYVIRTFTV QKKGYHEIRE LRLFHFISWP DHGVPCYATG 1140 
LLGFVRQVKF LNPPEAGPIV VHCSAGAGRT GCF1AIDTML DMAENEG WD IFNCVRELRA 1200 
QRVNLVQTEH QYVFVHDAJDL EACLCGNTAI PVCEFRSLYY NKRLDPQTN SSQIKDEFOT 1260 
LNIVTPRVRP EDCSIGLLPR NHDKNRSMD V LPLDRCLPFL ISVDGESSNY INAALMDSHK 1320 
QP AAFWTQH PLPNTVADFW RLVFDYNCSS WMLNEMDT A QFCMQYWFEK TSGCYGPIQV 1380 
BFVSADIDBD HHRIFRICN MARPQDGYRI VQHLQY1GWP AYRDTPPSKR SLUCWRRLE 1440 
KWQEQYDGRB GRTWHCLKG GGRSGTFCAI CSVCEMIQQQ NUDVFHIVK TLRNNKSNMV 1500 
BTLEQYKPVY BVALBYLSSF 

Scq ID NO: 99 DNA sequence 

Nndefc Acid Accession ft NMJD02988.1 

Coding sequence; 71-340 (underlined sequences correspond to start and stop codons) 

I 11 21 31 41 51 
I I t I I I 

CCGGCACGAG AGGAGTTGTG AGTTTCCAAG COCCAGCFCA CTCTG ACCAC TTCICTGCCT 60 
GCCCAGCATC AjTiAAGQGCC TTGCAGCTGC CCTCC1 1G1C CTCGTCIGCA OCATOGCCCT 120 
CTGCTCCTGT GCACAAGTTG GTACCAACAA AGAGCTCTGC TGOCTCGTCT ATACCTCCTG 180 
GCAGATTCCA CAAAAGTTCA TAGTTGACTA TTCTGAAACC AGCCCCCAGT GCCOCAAGCC 240 
AGGTGTCATC CTCCTAAOCA AGAOAGGCCG GCAGATCTGT GCTGACCCCA ATAAGAAGTG 300 
GGTCCAGAAA TACATCAGCG ACCTGAAGCT GAATGCCTOA GGGGCCTGG A AGCTGCGAGG 360 
GCCCAGTGAA CTTGGTGGGC CCAGGAGGG A ACAGGAGCCT GAGCCAGGGC AATGGCOCTG 420 
CCACOCTGG A GGCCACCTCT TCTAAG AGTC CCATCTGCTA TGCCCAGCCA CATTAACTAA 480 
CTTTAATCTT AGTTTATGCA TCATATTTCA TTTTGAAATT GATTTCTATT GTTGAGCTGC 540 
ATTATGAAAT TAGTATTTTC TCTGACA7XT CATGACATTG TCTTTATCAT CCITnXCCT 600 
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TTCCCTTCAA CTCTTOOTAC ATTCAATGCA TGGATCAATC AGTGTGATTA GCTTTCTCAG 660 
CAGACATTGT GCCATATGTA TCAAATGACA AATCTTTATT GAATGGTTTT GCTCAGCACC 720 
ACCTTTTAAT ATATTGGCAG TACTTATTAT ATAAAAGGTA AACCAGCATT CTCACTGTGA 780 
AAAAAAAAAA AAAAAAAAAA AAA 

Seq ID NO: 100 Protein sequence: - 
Protein Accession #: NPJJ02979. 1 

1 1) 21 31 41 51 
I I I I I I 

MKGLAAAIXV LVCTMALCSC AQVGTNKELC CLVYTSWQJP QKFIVDYSET SPQCPKPGV1 60 
LLTKRGRQIC ADPNKKWVQK YISDLKLNA 

Seq ID NO: 101 DNA sequence 

Nockk Acid Accessioo ft NM0155O7.2 

Coding sequence: 241-1902 (ondertined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

CCGCAGAGGA GCCTCGGCCA GGCTAGCCAG GGCGCCOCCA GCCCCTCCCC AGGOCGCGAG 60 
CGCCCCTGCC GCGGTGCCTG GCCTCCOCTC CCAGACTGCA GGGACAGCAC CCGOTAACTG 120 
CGAGTGGAGC GGAGGACCCG AGCGGCTGAG GAGAGAGGAG GCGGCGGCTT AGCTGCTACG 180 
GGGTCCX3GCC GGCGCCCTCC CGAGGGGGGC TCAGGAGGAG GAAGGAGGAC COGTGCGAGA 240 
ATGC CTCTGC CC7GGAGCCT TGCGCTCCCG CTGCTGCTCT CCTGOGTGGC AGGTGGTTTC 300 
GGGAACGCGG CCAGTGCAAG GCATCACGGG TTGTTAGCAT CGGCACGTCA GCCTGGGGTC 360 
TGTCACTATG G AACTAAACT GGCCTGCTGC TACGGCTGGA GAAGAAACAG CAAGGGAGTC 420 
TGTG AAGCTA CATGfXS AACC TGGATGTAAG TTTOOTGAGT GCGTGGGACC AAACAAATGC 480 
AGATGCTTTC CAGGATACAC CGGGAAAACC TGCAGTCAAG ATGTOAATOA GTO TOGAA TQ 540 
AAACCCOGGC CATGCCAACA CAGATGTGTG AATACACACG GAAGCTACAA GTGCTTTTGC 600 
CTCAGTGGCC ACATGCTCAT GCCAGATGCT ACGTGTGTG A ACTCTAGGAC ATGTGCCATG 660 
ATAAACTGTC AGTACAGCTG TGAAGACACA GAAGAAGGGC CACAGTGCCT GTGTCCATCC 720 
TCAGGACTCC GCCTGGCCCC AAATGGAAGA GACTGTCTAG ATATTGATGA ATGTGCCTCT 780 
GGTAAAGTCA TCTGTCOCTA CAATCGAAGA TGTGTGAACA CATTTGGAAG CTACTACTGC 840 
AAATGTCACA TTGGTTTOG A ACTGCAATAT ATCAGTGGAC G ATATG ACTG TATAGATATA 900 
AATGAATGTA CTATGGATAG CCATACGTGC AGCCACCATG CCAATTGCTT CAATAO0CAA 960 
GGGTCCTTCA AGTGTAAATG CAAGCAGGGA TATAAAGGCA ATGGACTTCG GTGTTCTGCT 1020 
ATOOCTGAAA ATTCTGTGAA GGAAGTCCTC AGAGCACCTG GTACCATCAA AGACAGAATC 1080 
AAG AAGTTGC TTGCTCACAA AAACAGCATQ AAAAAGAAGG CAAAAATTAA AAATOTTAOC 1140 
CCAGAACCCA CCAGGACTCC TACCOCTAAG GTGAACTTGC AGCCCTTCAA CTATGAAGAG 1200 
ATAGTTTOCA GAGGCGGGAA CTCTCATGGA GGTAAAAAAG GGAATGAAGA GAAAATGAAA 1260 
GAGGGGCTTC AGGATGAGAA AAGAGAAGAG AAAGCCCTGA AGAATGACAT AGAGGAGCGA 1320 
AGCCTGCGAG GAGATGTGTT TTTCCXTAAG GTGAATGAAG CAGGTGAATT CGGCCTGATT 1380 
CTGGTCCAAA GGAAAGCGCT AACTTCCAAA CTGGAACATA AAGATTTAAA TATCTCGGTT 1440 
GACTGCAGCT TCAATCATGO GATCTGTGAC TGGAAACAGG ATAGAGAAGA TGATTTTGAC 1500 
TGGAATCCTG CTGATCGAGA TAATGCTATT GGCTTCTATA TGGCAGTTCC GGCCTTGGCA 1560 
GGTCACAAG A AAGACATTGG CCGATTG AAA CTTCTOCTAC CTGAGCTGCA ACCCCAAAGC 1620 
AACTlCfUl 1 TO C1U1 11 G A TTACCGOCTO GCCGGAGACA AAGTCGGGAA ACTTCGAGTG 1680 
TTTGTGAAAA ACAGTAACAA TGCCCTGGCA TOGGAGAAGA OCACG AGTGA GGATGAAAAG 1740 
TGGAAGACAG GGAAAATICA GTTGTATCAA GGAACTGATG CTACCAAAAG CATCATTTTT 1800 
GAAGCAGAAC GTOGCAAGGG CAAAACCGGC GAAATCGCAG TGGATGGOGT CTTGCTTGTT 1860 
TCAGGCTTAT GTDCAGATAG (XTTTTATCT GTGGATGA CT O AA TGTTACT ATCTTTATAT 1920 
TTCA C1 1 IU1 ATGTCAGTTC C C i TiG l 111 1' 1 1UATATTOC ATCATAOGAC CTCTOOCATT 1980 
TTAGAATTAC TAGCTGAAAA ATTGTAATGT ACCAACAGAA ATATTATTGT AAGATGCCTT 2040 
TCTTGTATAA GATATGCCAA TATTTGCTTT AAATATCATA TCACTGTATC TTCTCAGTCA 2100 
TTTCTGAATC TTTOCACATT ATATTATAAA ATATGGAAAT GTCAGTTTAT CTCCCCTCCT 2160 
CAGTATATCT GATTTGTATA AGTAAGTTG A TGAGCTTCTC TCTACAACAT TTCTAGAAAA 2220 
TAGAAAAAAA AGCACAGAGA AATGTTTAAC TGTTTGACTC TTATGATACT TCTTGGAAAC 2280 
TATGACATCA AAGATAG ACT TTTGCCTAAG TGGCTTAGCT GGGTCTTTCA TAGOCAAACT 2340 
TGTATATTTA AATTCTTTGT AATAATAATA TCCAAATCAT CAAAAAAAAA AAAAAAAA 

Seq ID NO: 102 Protein sequence: 
PmtdnAccesskmft NP_0563223 

1 11 21 31 41 51 
I I I I I I 

MFLPWSLALP IXLSWVAGGF GNAASARHHG LLA&ARQFGV CHYGTKLACC YGWRRNSKGV 60 
CEATCEPOOC FGECVGPNKC RCFPGYTGKT CSQDVNECGM KPRPCQHRCV NTHGSYKCFC 120 
LSGHMLMPDA TCVNSRTCAM INCQYSCEDT BEGPQCLCPS SGLRLAPNGR DCLDIDECAS 180 
GKVKPYNRR CYNTFGSYYC KCHIGFELQY BGRYDCIDI NECTMDSHTC SHHANCFNTQ 240 
GSFKCXCXQG YKGNGLRCSA IPENSVKEVL RAPGTTKDRJ KKLJLAHKNSM KKKAKJKNVT 300 
PBK 1 K 1 K 1 PK VNLQPFNYEB IVSRQGNSHG GKKGNEEKMK EGLEDEKREE KALKNDlEER 360 
SLRGDVFFPK VNEAGEFGLI LVQRKALTSK LEHKDLNISV DCSFNHGICD WKQDREDDFD 420 
WNPADRDNAI GPYMAVPALA GHKKDiGRLK LXXPDLQPQS NRXLFDYRL AGDKVGKLRV 480 
FVKNSNNALA WEKTTSEDEK WKTGK1QLYQ GTDATKSHF EAERGKGKTG EIAVDGVLLV 540 
SGLCPDSLLS VDD 

Seq ID NO: 103 DNA sequence 

393 



WO 02/059377 



Nucleic Acid Accession #: NM0O156S.1 

Coding sequence: 67-363 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

GAGACATTCC TCAATTGCTT AG ACATATTC TGAGCCTACA GCAGAGGAAC CTCCAGTCTC €0 
AGCACCATGA ATCAAACTGC GATTCTGATT TGCTCCCTTA TCTTTCTGAC TCTAAGTGGC 120 
A1TCAAGGAG TACCTCTCTC TAGAACCGTA CGCTCTACCT GCATCAGCAT TAGTAATCAA 180 
CCTGTTAATC CAAGGTCTTT AGAAAA ACTT GAAATTATTC CTGCAAGCCA ATTTTGTCCA 240 
CGTGTTGAGA TCATTGCTAC AATGAAAAAG AAGGGTGAGA AGAGATGTCT GAATCCAGAA 300 
TCGAAGGCCA TCAAGAATTT ACTGAAAGCA GTTAGCAAGG AAATGTCTAA AAGATCTCCT 360 
TAAAACCAGA GGGG AGCAAA ATCGATGCAG TGCTTCCAAG GATGGACCAC ACAGAGGCTG 420 
CCTCTCCCAT CACTTCCCTA CATGGAGTAT ATGTCAAGCC ATAATTGTTC TTAGTTTGCA 480 
GTTACACTAA AAGGTGACCA ATGATGGTCA CCAAATCAGC TGCTACTACT CCTGTAGGAA 540 
GGTTAATGTT CATCATCCTA AGCTATTCAG TAATAACTCT ACCCTGGCAC TATAATGTAA 600 
GCTCTACTGA GGTGCTATGT TCTTAGTGG A TGTTCTOAOC CIGCTTCAAA TATTTCOCTC 660 

AccrrrcccA tcttocaago gtactaagga atctttctgc TrrGGGGrrr atcagaattc 720 

TCAGAATCTC AAATAACTAA AAGGTATGCA ATCAAATCTO CTTTTTAAAG AATGCTCTTT 780 
ACTTCATGGA CTTCCACTGC CATCCTCCCA AGGGGCCCAA ATrCTTTCAG TGGCTACCTA 840 
CATACAATTC CAAACACATA CAGGAAGGTA GAAATATCTO AAAATGTATG TGTAAGTATT 900 
CTTATTTAAT GAAAGACTGT ACAAAGTATA AGTCTTAGAT GTATATATTT CCTATATTGT 960 
TTTCAGTGTA CATGG AATAA CATGTAATTA AGTACTATGT ATCAATG AGT AACAGGAAAA 1020 
TTTTAAAAAT ACAGATAGAT ATATGCTCTG CATGTTACAT AAGATAAATG TGCTGAATGG 1080 
TTTTCAAATA AAAATGAGGT ACTCTCCTGG AAATATTAAG 

SeqlDNCh 104 Protem seqnence: 
Protein Accession NP_00 1556.1 

1 11 21 31 41 51 

MNOTAILICC UFLTLSGIQ G VPLSRTVRC TOSISNQPV NPRSLEKL0 IPASQFCPRV 60 
EUATMKKKG EKRCLNPESK AIKNLUCAVS KEMSKRSP 

Seq ID NO: 105 DNA sequence 

Nucleic Acid Accession jfc NM 015068.1 

Coding B f qnrmr* 1 170-2243 (underlined sequences correspond to start and stop codons) 
] 11 21 31 41 51 

OTAACAACCG TCAOCCTQGG TCCCGACTGC CCACCIT3CTC CTCCTCCCCC TCOCCCCAAC 60 
AACAACAACA ACAACAACTC CAAGCACACC GGCCATAAGA GTGCGTGTGT CCCCAACATG 120 
ACCGAACGAA GAAGGGACGA GCTCTCTGAA GAGATCAACA ACTTAAGAGA GAAGGTCATO 180 
AAGCAGTCGG AGGAG AACAA CAACCTGCAG AGOCAGGTGC AOAAGCTCAC AOAGGAGAAC 240 
ACCACCCTTC GAGAGCAAGT GGAACCCACC CCIGAGGATG AGGATGATGA CATCGAGCTC 300 
CGCXKjTGCTG CAGCAGCTGCTGCCCCACCC CCTCCAATAG AGGAAGAGTG CCCAGAAGAC 360 
CTCCCAG AGA AGTTCGATGG CAACCCAGAC ATGCTGGCTC CTTTCATGGC CC AGTGC CAG 420 
ATCTTCATGG AAAAG AGCAC CAGGGATTTC TCAGTTGATC GTGTCCGTGT CTGCTTCGTG 480 
ACAAGCATGA TO ACCGGCCG TGCTGCCCGT TGGGCCTCAG CAAAGCTGGA GCGCTCCCAC 540 
TACCTGATGC ACAACTACCC AGCTTTCATG ATGGAAATGA AGCATGTCTT TGAAGACCCT 600 
CAG AGGCGAG AGGTTGCCAA ACGCAAGATC AGACGCCTGC GCCAAGGCAT GGGGTCTGTC 660 
ATCOACTACT CCAATGCTTT CCAG ATGATT GCCCAGG ACC TGGATTGGAA CGAGCCTGCG 720 
CTGATTGACC AGTACCACGA GGGCCTCAGC GACCACATTC AGGAGGAGCT CTCCCACCTC 780 
GAGGTCGCCA AGTOGCTGTC TGCTCTGATT GCGCAGTGCA TTCACATTGA GAGAAGGCTG 840 
GCCAGGGCTG CTOCAGCTCG CAAGCCACGC TCGCCACCCC GGGCGCTGGT GTTGCCTCAC 900 
ATTGCAAGCC ACCACCAGGT AGATCCAACC GAGCCGGTGG GAGGTGCCCG CATGCGCCTG 960 
ACGCAGGAAG AAAAAGAAAG ACGCAGAAAG CTGAACCTGT GCCTCTACTG TGGAACAGGA 1020 
GGTCACTACG CTGACAATTG TCCTGCCAAG GCCTCAAAGT CTTCGCCGGC GOG AAACTCC 1080 
CCOQCCCOGC TOTAGAGGOA OCTTCAGCGA CCGGGCCAGA AATAATAAGG TCCCCACAAG 1140 
ATGATGCCTC ATCTCCACAC TTGCAAGT GA TGC TCCAGAT TCATCTTCCG GGCAGACACA 1200 
CCCTGTTCGT CCGAGCCATG ATCGATTCTG GTGCTTCTGG CAACTTCATT GATCACG AAT 1260 
ATGTTGCTCA AAATGGAATT CCTCTAAGAA TCAAGGACTG GCCAATACTT GTGGAAGCAA 1320 
TTGATOGGCG CCCCATAGCA T0GGGOCCAO nOTCCACGA AACTCACGAC CTGATAGTTG 1380 
ACCTGGGAGA TCACCG AGAG GTGCTGTCAT TTGATGTGAC TCAGTCTCCA TTCTTCCCTG 1440 
TCGT0CTAOQ GGTTCGCTGG CTGAGCACAC ATGATCCCAA TATCACATGG AGCACTCOAT 1500 
CTATCGTCTT TG ATTCTGAA TACTGCOGCT ACCACTGCCG GATGTATTCT CCAATACCAC 1560 
CATCGCTCCC ACCACCAGCA CCACAACCGC CACTCTATTA TCCAGTAGAT GG ATACAGAG 1620 
riTACCAACC AGTGAGGTAT TACTATGTCC AGAATGTGTA CACTCCAGTA GATGAGCACG 1680 
TCTACCCAGA TCACCGCCTC GTTGACCCTC ACATAGAAAT GATACCTGOA GCAC ACAGT A 1740 
TICCCAGTGO ACATGTGTAT TCACTGTCCG AACCTGAAAT GGCAGCTCTT CGAGATTTTG 1800 
TOGCAAG AAA TGTAAAAGAT GGGCTAATTA CTCCAACGAT TGCACCTAAT GGAGCCCAAG 1860 
TTCTCCAGGT GAAG AGGGGG TGGAAACTGC AAGTTTCTTA TGATTGCCGA GCTCCAAACA 1920 
ATTTTACTAT CCAGAATCAG TATCCTOGCC TATCTATTCC AAATTTAGAA GACCAAGCAC 1980 
ACCTGGCAAC GTACACTGAA TTCGTACCTC AAATACCTGG ATACCAAACA TACCCCACAT 2040 
ATGCCGCGTA CCCGACCTAC CCAGTAGGAT TCGCCTGGTA CCCAGTGGGA CGAGACGGAC 2100 
AAGGAAGATC ACTATATGTA CCTGTGATGA TCACTTGGAA TCCACACTGG TACCGCCAGC 2160 
CTCCGGTACC ACAGTACCCG CCGCCACAGC CGCCGCCTCC ACCACCAGCA CCGCCGCCGC 2220 
CTCCATCTTA CAGTACCCTG TAAATACCTG TCATGTCCTT CAGGATCTCT GCCCTCAAAA 2280 
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TTTATTCCTQ TTCAGCTTCT CAATCAGTGA CTGTGTOCTA AATTTTAGGC TACTOTATCT 2340 
TCAGGCCACC TGAGGCACAT CCTCTCTG AA ACGGCTATOG AAGGTTAGCO CCACTCTGGA 24C0 
CTOGCACACA TCCTAAAGCA CCAAAAG AOC TTCAACATTT TCTGAGAGCA ACAGAOTATT 2460 
TGCCAATAAA TOATCTCTCA TTTTTCCACC TTGACTOCCA ATCTAACTAA AATAATTAAT 2520 
AAGTTTACTT TCCAGCCAGT CCTGGAAGTC TGGGTTTTAC CTGCCAAAAC CTCCATCACC 2580 
ATCTAAATTA TAGGCTGOCA AATTTGCTGT TT AACATTTA CAGAG AAGCT GATA CAAACG 2 640 
CAGGAAATGC TGATTTCTTT ATGGAGGGGG AG ACGAGGAG GAGGAGGACA TGACTTTTCT 2700 
TGCGOTTTCG GTAOOCTCTT TTTAAATCAC TGGAGGACTG AGGCCTTATT AAGGAAGCCA 2760 
AAATTATOGG TGCAGTGTGG AAAGGCTTCC GTG ATCCTCT CGCTGCACCC TTAGAAACIT 2820 
CACCGTCTTC AAACTOCATT TCCATGGTTC TGTTAATTCT CAAGGAGCAG CAACTCGACT 2880 
GGTTCTCCCA GOAGCAGGAA AAACCCTTOT GACATOAAAC ATCTCAGGCC TO AAAAG AAA 2940 
GTGCTCTCTC AGATGGACTC TTGCATGTTA AGACTATGTC TTCACATCAT GGTGCAAATC 3000 
ACATGTACCC AATGACTCCG GCTTTGACAC AACACCTTAC CATCATCATG CCATGATGGC 3060 
TTCCACAAAO CATTAAACCT GGTAACCAGA GATTACTGGT GGCTCCAGCG TTGTTAGATG 3120 
TTCATGAAAT GTGACCACCT CTCAATCAGC TTTGAGGGCT AAAGAGTAGC ACATCAAAAG 3180 
GACTOCAAAA TOOCATACCC AACTCTTAAG AGATTTGTCC TGGTACTTCA GAAAGAATTT 3240 
TCATGAGTGT TCTTAATTGO CTGGAAAAGC ACCAGCTGAC GTTTTGGAAG AATCTATCCA 3300 
TGTGTCTGCC TCCATATGCA TCTGGGCATT TCATCTTCAG TCCCCTCATT AGACTQTAGC 3360 
ATTAGGATGT GTGGAGAG AG GAGAAATGAT TTAGCACCCA GATTCACACT CCTATGCCTG 3420 
GAAGGGGGAC ATCTTTG AAG AAGAGGAATT AGGGCTGTGG ACACTGTCTT GAGGATGTGG 3480 
ACTTCCTTAG TGAGCTCCAC ATTACTTGAT GGTAACCACT TCAAAAGGAT CAGAATCCAC 3540 
GTAATQAAAA AOOTOCCTCT AGAOGATOO A GCTOATOTGA AGCTGCCAAT GGATG AAAAG 3600 
, CCTCAGAAAG CAACTCAAAG GACTCAAAGC AACGGACAAC ACAAGAGTTG T CTTCAGCC C 3660 
AGTCACACCT CTGATGTCCC CTGGAAGdT iG 1 G CTAACC TGGGACTGCC TCACTTCCTT 3720 
TAGCCTGGTC CCTTGCTACT ACCTTGAACT GTTTTATCTA ACCTCTCTTT TTCTGTITAA 3780 
TTCTTTGCTA CTGOCATTGA COCTGCTGCA GGATTTOTGT CATTITOCTG OCTGGTTGCT 3840 
G AGACTCCAT TTTGCTGCCA CACACAGAGA TGTAAGAGGC AGGCTTTAAT TGCCAAAGCA 3900 
CAGTTTGAGC AGTAGAAAAC AACATGGTGT ATATCTCAAA TTGCCTGACA TGAAGAGGAG 3960 
TCTAAOGGTG AAGTTTCACT TTTCATCAGC ATCATCTTTC ACATGTTCAT TATCATOC GC 4020 
TCTTATTCTT GCATGTTTAA ACACTTAAAA TTTTTAGTAT AATTTTTAGT GTGTTTTGAA 4080 
GTGGTGACTA GGCTTTCAAA AACTTCCATT GAATTACAAA GCA CTATO CA tfilCl lATTG 4140 
TTAAACTAAG TAAAAATGAT AAGTAACATA GTGTAAAATA TTCCTTTACT GTGAACTTCT 4200 
TACAATGCTG TGAATOAGAG GCTCCTCAGA ACTOOAGCAT TTGTATAATA ATTCATCCTG 4260 
TTCATCITCA ATTTTAACAT CATATATAAT TTCAATTCTA TCAATTGGGC CTTTAAAAAT 4320 
CATATAAAAG GATATAAAAT TTGAAAAGAG AAACCTAATT GGCTATTT AA TCC AAAACAA 4380 
UHinill TOCTTCAATO QAATCAGAAA QCTTQTCAAT CACTCATGTG TTT TAQAGTA 4440 
ATTACTTTTA AAATGGTGCA 1 1 1U10CI JC TGAACTATTT TGAAGAGTCA CI 1C1UU 1 A 4500 
CCTCAAGTAT CAATTCATCC TCCATACATT TGAATTCAAG TTGTTTTTTG TCAAATTTAC 4560 
AOTICTCAAT TCATCTTCAA GCTGCAGGGT OCCTAOAAAT GGQCCOTTGT CTOTA GOCCT 4620 
GGCATGTGCA CAGGGACATT TGCCACCACT GCAAGCAAAA GTCTGGAGAA GTTCAOCAAC 4680 
GACAAGAACG ATTAGGGAAA ATATGCTGCT GTGGGTTAAC AACTCAGAAA GTOCCTGATC- 4740 
CACATTTGGC TCITT ACT AA AGCTTOTOAT TAACTTTTTG GCAGTGTGTA CTATGCTCTA 4800 
TTGCTATATA TGCTATCTAT AAATGTAGAT GTTAAGGATA AGTAATTCTA AATTTATTAT 4860 
TCTATAGTTT TGAAGTTTGG TTAAGTTTCC TTTCACTCAA TTGATTTATT TTGTTGTTAA 4920 
TCAAATTTAT GTTAATTGG A TCCTTTAAAT TTTTTTTGGC ATTTTCCAAC AAAAATGGCT 4980 
TTATTCATAA GAAAGG AAAA AAATCAATGG AATTTGATAT CTAAAGAAGT TAGAAAGGGA 5040 
GCAAAATAAA AAACATAAAG GAGATAGATG AATTAGTAAG CAAATCAGTA GTCGAGTTTT 5100 
TCAAACTGGC AAAATTAATT AATTG ACTTT TAGCCCAAAT TTACATTGTT AATTAAAXCA 5160 
AGAAGGAAGA AGATCTAAGA GCTCCCATTG ATAGQCAAGC CTAGAGAGAA CTAGGTAAAT 5220 
TTATCATGCT AGGATATTGA AACACAGAAA GTTTACATAC ATTTATGAAG GGTCAAXTTA 5280 
GTTTGGACAG TGAGGTATTT GTCTTAGTGG AAAAAAGGAG AATTAGTCTG ATCAAATCGT 5340 
GAAGTAATAC AGTGAACTTG CAGGTGCACA AAATAAGAGG GCCACATCTA TATGGTGCAG 5400 
TCTGG AATTC TGTTTAAGTT TGTAGOTACC TCTTGOACTT CTGAATTGAT CCAGTTGTCA 5460 
TCCACCACAG ACATCTCACA TCAGATACAG ACAGTTCCAA GATTGACAAC AG AGAAC AAC 5520 
CTGCTGGAAA QACCTGGGCA GAAATGGAGA GCCCTGCGG Q AAOCATG CTA CATTTTCATC 5580 
TAAAGAOAOA ATOCACATCT GATGAOACTO AAAUT1UI 1 i tf ITUl j 1 1A G ATTOTAGAAT 5640 
GGTATTGAAT TGGTCTGTGG AAAATTGCAT TGCTTTTATT TCTTTGTGTA ATCAAGT TTA 57 00 
AGTAATAGGG GATATATAAT CATAAGCATT TTAGGGTGGG AGGGACTATT AAGTAATTTT 5760 
AAGTGGGTGO GQTTATTTAG AATGTTAGAA TAATATTATG TATTAGATAT CGCTATAAGT 5820 
GGACATGCGT ACTTACTTGT AACCCTTTAC CCTATAATTG CTATCCTTAA AGATTTCAAA 5880 
TAAACTCGGA GGGAACTGCA GGGAGACCAA CTTATTTAGA GCGAATTGGA CATGGATAAA 5940 
AACCCCAGTG GGAGAAAGTT CAAAGGTGAT TAGATTAATA ATTTAATAGA GGATGAGTGA 6000 
CCTCTG ATAA ATTACTGCTA GAATGAACTT GTCAATOATO GATGGTAAAT TTTCATGGAA 6060 
GTTATAAAAG TGATAAATAA AAACCCTTGC TTTTACCCCT GTCAGTAGCC CTCCTCCTAC 6120 
CACTGAACCC CATTGCCCCT ACCCCTCCTT CTAACTTTAT TGCTGTATTC TCTTCACTCT 6180 
ATATTTCTCT CTATTTGCTA ATATTGCATT GCTGTTACAA TAAAAATTCA AT AAAOATTT 6240 
AGTGGTTAAG TGC 

Seg ID NO: 106 Protrin seocencc: 
Proton Acccsskm ft NP055883.1 

1 11 21 31 41 51 

MTERRRDELS EeSkNLREKV MKQSBBNNNL QSQ VQKLTEE NTTLREQVEP TPBDEDDD1E 60 
LRGAAAAAAP PPPlEEKPE WJ^EKHXjNP DMLAI^MAQC QIFMEKCTRD FSVDRVRVCF 120 
VTSMMTGRAA RWASAKLEBS HYLMHNYPAF MMEMKHVFBD PQRREVAKRK IRRLRQGMGS 180 
VIDYSNAFQM IAQDLDWNEP ALEDQYHEGL SDHIQEELSH LEVAKSLSAL IGOCDflERR 240 
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LARAAAARKP RSPPRALVLP HIASHHQVDP TEPVGOARMR LTQEEKERRR KLNLCLYCGT 300 
OOHYADNCPA KASKSSPAGN SPAPL 

SeqIDNO: 107 DNA sequence 

Nudefe Add Accession fc NM0O3679.1 

Coding sequence: 47- 1 507(uodetiuied sequences correspond to start and stop codons) 

t II 21 31 41 SI 
I I t I I I 

GGCACGAGCA GAAGCAACAA TAATTGTGAA AAATACTTCA GCAGTTATOQ ACTCATCTOT 60 
CATTCAAAGG AAAAAAGTAG CTGTCATTGG TGGTGGCTTG GTTGGCTCAT TACAAGCATG 120 
CTTTCTTGCA AAGAGGAATT TCCAGATTGA TGTATATGAA GCTAGGGAAG ATACTCGAOT 180 
GGCTACCTTC ACACGTGGAA GAAGCATTAA CTTAGCCCTT TCTCATAGAG GACGACAAGC 240 
CTTGAAAGCT GTTGGCCTGG AAGATCAGAT TGTATCCCAA GGTATTCOCA TGAGAGCAAG 300 
AATOATCCAC TCTCTTTCAG GAAAAAAGTC TGCAATTCCC TATGGGACAA AGTCTCAGTA 3<S0 
TATTCTTTCT GTAAGCAGAG AAAATCTAAA CAAGGATCTA TTGACTGCTG CTGAGAAATA 420 
COCCAATGTG AAAATGCACT TTAACCACAG GCTGTTGAAA TGTAATCCAG AGGAAGGAAT 480 
GATCACAGTG CTTGG ATCTG ACAAAGTTCC CAAAGATGTC ACTTGTG ACC TCATTGTAGG 540 
ATGTGATGGA GCCTATTCAA CTGTCAGATC TCACCTGATG AAGAAACCTC GCTTTGATTA 600 
CAGTCAGCAG TACATTCCTC ATGGGTACAT GGAGTTGACT ATTCCACCTA AGAACGGAGA 660 
TTATGCCATG GAACCTAATT ATCTGCATAT TTGGCCTAGA AATACCTTTA TGATGATTGC 720 
ACTTCCTAAC ATG AACAAAT CATTCACATG TACTTTOTTC ATGCCCTTTG AAGAGTTTGA 780 
AAAACTTCTA AOCAGTAATG ATGTGGTAGA TTTCTTCCAG AAATACTTTC CGGATGCCAT 840 
CCCTCTAATT GGAGAGAAAC TCCTAGTGCA AGA1 ITCT1C CTGTTGCCTG CCCAGCCCAT 900 
GATATCTGTA AAGTGCTCTT CATTTCACTT TAAATCTCAC TGTGTACTGC TGGGAGATGC 960 
AGCTCATGCT ATAGTGCCGT TTTTTGGGCA AGO AATGAAT GCGGGCTTTG AAGACTGCTT 1020 
GGTATTTGAT GAGTTAATGG ATAAATTCAG TAACGACCTT AG 1 1 1GTGTC TTCCTGTGTT 1080 
CTCAAGATTG AGAATCCCAG ATGATCACGC GATTTCAGAC CTATCCATGT ACAATTACAT 1 140 
AGAG ATGCG A GCACATGTCA ACTCAAGCTG GTTCATTTTT CAGAAGAACA TGG AGAGATT 1200 
TCTTCATGCG ATTATGCCAT CGAOCTTTAT CXXTTCTCTAT ACAATGGTCA CTTTTTCCAG 1260 
AATAAGATAC CATGAGGCTG TGCAGCGTTG GCATTGGCAA AAAAAGGTGA TAAACAAAGG 1320 
ACTCTTTTTC TTGGGATCAC TGATAGCCAT CAGCAGTAOC TACCTACTTA TACACTACAT 1380 
GTCACCACG A TCTTTOCTCT GCTTGAGAAG ACCATGG AAC TGG ATAGCTC ACTTOCGGAA 1440 
TACAACATGT TTCCCCGCAA AGGCCGTGGA CTCCCTAGAA CAAATTTCCA ATCTCATTAG 1500 
CAGGTGATAG AAAGGTTTTG TGGTAGCAAA TGCATG ATTT CTCTGTGACC AAAATTAAGC 1560 
ATG AAAAAAA TGTTTCCATT GCCATATTTG ATTCACTAGT GGAAG ATAGT GTTCTGCTTA 1620 
TAATTAAACT GAATGTAGAG TATCTCTGTA TGTTAATTGC AATTACTGGT TGGGGGGTGC 1680 
ATTTTAAAAG ATGAAACATG CAGCTTCCCT ACATTACACA CACTCAGGTT GAGTCATTCT 1740 
AACTATAAAA GTOCAATGAC TAAGATCCTT CACTTCTCTG AAAGTAAGGC CCTAGATGCC 1800 
TCAGGGAAG A CAGTAATCAT GCCTTTTCTT TAAAAGACAC AATAGGACTC GCAACAGCAT 1860 
TGACTCAACA CCTAGGACTA AAAATCACAA CTTAACTAGC ATGTTAACTG CACTTTTCAT 1920 
TACGTGAATG GAACTTACCT AAOCACAGGG CTCAG ACTTA CTAG ATAAAA CCAG AAATGG 1980 
AAATAAGGAA TTCAGGGGAG TTCCAGAGAC TTACAAAATG AACTCATTTT ATTTTCCCAC 2040 
CTTCAAATAT AAGTATTATC ATCTATCTGT TTATCGTCTA TCTATCTATC ATCTATCTAT 2100 
CTATCTATCA TCTATCTATC TATCTATCTA TCTATCTATC TATCTATCTA TCTATCTATC 2160 
TCTATTTATT TATGTATTTA GAGATCAGGT CTCACTCTGT TGACCAGGCT GGAGTGCAGT 2220 
GGTGAGATCT GGGTTCACTG CAACCTCTGC CTCCTGGGCT CAAGCAATCC TCCCACTTCA 2280 
GCCTCCCAAA TAGCTGGGGC TACCATGGTA TTTTTCAGTA GAGACCGGGT CTTGCCATGC 2340 
TGCCCAGGCC AGTCTCAAAC TCCTGGCCTC ATGTGATCTG CCCACCTCAG CCTCCCAAAG 2400 
TACAGGGATT AGAGTTGTGA CMXACCGCTG CCAGCCCAGA GTTACCCTCT AAAGATAAGA 2460 
AAAAGGCTAT TAATATCATA CTAAGTGAAG GACAGGAAAG GGTTTTATTC ATAAATTAAA 2520 
TGTCTACATG TGCCAGAATG GAAAGGAAAC AAGGGGAGAC AACTTTTATA GAAATACAAA 2580 
GCCATTACTT TATTCAATTT CAGACCCTCA GAAGCAATTT ACTAATTTAT TCTTCG ACTA 2540 
CATACTGCAG CAGAACCAGC AATACACTTG ATTTTTAAAA GCACATTTAG TGAAATGTTT 2700 
TCTTTGGTTC ATCCTTCTTT AACAGGCTGC TGAGTCACTC AGAAATCCTT CAAACATGAT 2760 
TAATTATGAA GATG AAACAC TAGAGTCATA TAAGAAATAA AAATTGGGCA ATAAAATAAA 2820 
ATGATTCAGT GTTTCTTTTC TATATTGTCA ATGAAAACCT TGAGTTCTAA TAATCCATGT 2880 
TCAGTTTGTA GGGAAAGAAA AAATAATTTT TCCTTCTACC CACTTTAGGT TCCTTGGCTG 2940 
GGGCCCCTAT AACAAAAGAC AGATTGACAA GAGAAAAACA AACATAAATT TATTAGCGGG 3000 
TATATGTAAT ATATATGTGG GAAATACAGG GGAATGAGCA AATCTCAAAG AGCTGGCGTC 3060 
TTAGAACTCC CTGGCTTATA TAGCATCGAC AAAGAACAGT AAATTTTTAG AGAAACAACA 3120 
AAACAAAG AA AAAG AGCTTT G AGTCTGTAG GGGCAGCAAT TTGGGGGAAG CAAAT ATATG 3180 
GGAGTTTGCC TTGTAGATTC CTCTGGTGGT GGTCTCCAGG CTGACAAGGA TTCAAAGTTG 3240 
TCTCTGAAAC TOCT C l TtXIT CATACTGCAC ATATAAAACG TCI 1 1 WIT i CCAACAAGAO 3300 
GATTTCTTTT TCATTCTAGA ATTATCTCCT TGATAACTTG ATCAGATATA GGACATGACA 3360 
CTGAATAGAO TCCAACAGTA CAAAAAAAAT TCAGTATGTT CTAGCTACTT CACACATGTG 3420 
TACGCG ACAG TTATTTTTAC AGTAAGGTAT TTTCGAGAAA AATGCATTAC GTGTTTTGGA 3480 
AAATAG AGTA ATTTAAAAAA TATATTTGAA ATGAAAATCT CCAACACATT AGAAGATGAT 3540 
GATGTTAGAT GCCCATCGTG TGCCACAAGT GGTTTTTTCA TTATGTAAAG CACCCGTTGA 3600 
ATTAAAAGAA 111U11 HTU 'L ICAACCTCTTCCTOAGGCC CAAGAGCATA TGGGCAATTC 3660 
GGATTTCCTG CTGGACCACA AGGTTCTGTT GATATTACAT AG AAACGGGT ATTCCAGACA 3720 
CTTCTTATGA TGAAAGTCCA AAAGTGGCAT CCAATTTAAG GCCCCATCTT TCGTTGCCAT 3780 
TCTTCATTCC TACAAAGGAC GAACTTGGAT TACATCAACT TTGGACCCAT TGGTTTTGTC 3840 
GCTGTCGTCA ACTGACAGTG ATTCACCACT GGTGATGATA AAAATGATGG AAGAAGAGTT 3900 
GAAAGTCACT TTTTTCTTTG GCCrGTCCCC ATCTTTCTGT GACATCACAA TGGGTCTGAT 3960 
CTGCATTTCA CTTCCAGCTG CTGGTAGGTC TTTAGCAGGC CTCTGGCACC TCAGCAGTCG 4020 
G AGGCACAG A AGCTGCAAAA GGGATCTTCG AAACTOGGCA GAG AAAAAAT AAAGTGO AAT 4080 
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ATTAAGTAAA AGTTGGGCAC TAATCTGGAT TAACATTCGA GGAAATCAGT TGAGCTGATT 4140 
TAAQTTOTTT TTTGTTTGTT AGCAGGTGTO GATGTGGGGT TATGTGGTCA TGCTCAGATC 4200 
TACCTAAATC ACCCCAGAGC TTTATGTCTT TTATTCATTC TAAATCTTAT TAACCGG AAT 4260 
ATGTAGGACC ATTTCAATAC CTTGTAATCC TCCAAGCTTC AATCTGCACA CACTTTCTAT 4320 
GAGGGCAGGT ACAACTATTA AGAGATTTTG AACATTAAGT TAGTCCACAA ATATTCAGTG 4380 
GOCATCTACT AGGTGACAGC CACTGTGCTA TAATTAGAGA CTTTTTACTA TAAGCATCAA 4440 
AAACAGATAA GGCTCTTCCT GGCAGAGTTT ACAGCCTGOT GTACTTGCTA ATGTCTCTTT 4500 
AATTAGGTGA AGAATTTTTT TTTTCTATCG AAATTACTAA TCAGTTGGGG AAAAAAATAC 4560 
TATAGCAGAC AGCACTAATG TCATCAACAA ACATTGTTCT TCTCOGTGTC CTGGGTACAA 4620 
CATOGAATAA TATTTCTTOG CCTOCTTTCC GCTTCTCCTC TCTGCTGTTC CTCTCTACAA 4680 
GAACCTGGGA GGGCAACGOC TAAAGATCAT AATATCACAA TGGAAGGAAC CTAGATTCCT 4740 
AAATG ACTGC ATAGG ACAGA TCCCATCTCC TOCACCCAAT ACATTATTAG ACTGAACTGT 4800 
GAOCTGAAAT GAGCAATAAA CTCTGTATTA ATTCACTGAA ATGTTGGGGT TGCTTGTTAT 4860 
AGTAGTCGGT CCATCATGAC CAGTAAAACA TAAATCAAAA GTTAATGTAA TTGTTATCCC 4920 
ATTATTTAGA GCGAAATAAA TGTTGAATAT ATGGACTTTC TCAGATTAGG AAATACCAAT 4980 
T AAAAATATA ATAAATAGCT 

SeqIDNO: 108 Protein scqaence: 

Ptntcin Accession ft NP_003670.1 1 

1 11 21 31 41 51 
I I I I I I 

MDSSV1QRKK VAVIGGGLVG SLQACFLAKR NFQIDVYEAR EDTRVATFTR GRSINLALSH 60 
RGRQALKAVG LEDQIVSQGI FMRARMIHSL SGKKSAEPYG TKSQYILSVS RENLNKDLLT 120 
AAEKYPNVKM HPNHRLLKCN PEEGMTTVLG SDKVPKDVTC DUVGCDGAY STVRSHLMKK 180 
PRFDYSQQY1 PHGYMELTIP PKNGDYAMEP NYLHIWPRNT FMMIALPNMN KSFTCTLFMP 240 
FEEFEKLLTS NDWDFFQKY FPD AIPLIGH KLLVQDFFLL P AQPMISVKC SSFHFKSHCV 300 
LLGDAAHATV PFFGQGMNAG FEDCLVFDEL MDKFSNDLSL CLPVFSRUU PDDHAISDLS 360 
MYNYIEMRAH VNSSWFIFQK NMERFLHAIM PSTFIPLYTM VTPSRIRYHE AVQRWHWQKK 420 
VINKGLFFLG SUAISSTYL LIHYMSPRSF LCLRRPWNW1 AHFRNTTCFP AKAVDSLEQI 480 
SNLKR 

SeqlDNCh 109 DNA sequence 

Nucleic Acid Accession #: NM_0061 15.1 

Coding sequence: 236-1765 (underlined sequence* c o rre spon d to start and stop codecs) 
1 11 21 31 41 51 

GCTTCAGGGT AC^CTCCCC CGCAGCCAGA AGCCGGGCCT GCAGCCCCTC AGCACCGCTC 60 
CGGGACACCC CACCCGCTTC CCAGGCGTGA CCTGTCAACA GCAACTTCGC GGTGTGGTGA 120 
ACTCTCTGAG GAAAAAOCAT TTTGATTATT ACTCTCAGAC GTGCGTGOCA ACAAGTGACT 180 
GAG ACCTAGA AATCCAAGCG TTGGAGGTCC TGAGGCCAGC CTAAGTOGCT TGAAAATjQGA 240 
ACGAAGGCGT TTGTGGGGTT CCATTCAG AG CCGATACATC AGCATGAGTG TGTGGACAAG 300 
CCCACGGAGA CTTOTGGAGC TGGCAGGGCA GAGCCTGCTO AAOOATOAGG CC CTGGCC AT 360 
TGOCGCCCTG GAGTTGCTGC CCAGGG AGCT CTTCCCGCCA CTCTTCATGG CAGCCTTTGA 420 
CGGGAGACAC AGCCAGAOCC TGAAGGCAAT GGTGCAGGOC TGGCCCTTCA GCTGCCTCGC 480 
TCTGGGAGTG CTGATGAAGG GACAACATCT TCACCTGGAG ACXTTCAAAG CTGTGOTGA 540 
TGGACTTGAT GTCCTDCTTO CCCAGGAGGT TOGCCCCAGG AGGTGGAAAC TTCAAGTGCT 600 
GGATTTACGG AAGAACTCTC ATCAGGACTT CTGGACTGTA TGGTCTGGAA ACAGGGCCAG 660 
TCTGTACTCA TTTCCAGAGC CAGAAGCAGC TCAGCCCATG ACAAAGAAGC GAAAAGTAGA 720 
TCGTTTGAGC ACAGAGGCAG AGCAGCCCTT CATTCCAGTA GAGG7GCTCG TAGACCTGTT 780 
CCTCAAOGAA GGTGCCTGT3 ATGAATTOTT CTCCTACCTC ATTGAGAAAG TGAAGCGAAA 840 
QAAAAATGTA CTACGCCTGT GCTGTAAGAA GCTGAAGATT TTTGCAATGC CCATGCAGGA 900 
TATCAAGATG ATCCTGAAAA TGGTGCAGCT GG ACTCTATT GAAGATTTCG AAGTGACTTG 960 
TACCTOGAAG CTAOCCACCT TGGCGAAATT TTCTOCTTAC CTGGGOCAGA TGATTAATCT 1020 
GCGTAGACTC CnXTCTCCC ACATCCATGC ATCTTCCTAC ATTTOCCCGG AGAAGGAAGA 1080 
GCAGTATATC GCCCAGTTCA CCTCTCAGTT CCTCAGTCTG CAGTGCCTGC AGGCTCTCTA 1140 
TGTGGACTCT TTATTTTTCC TTAGAGGCCG CCTGGATCAG TTGCTCAGGC AOGTGATGAA 1200 
CCCCTTOGAA ACCCTCTCAA TAACTAACTG CCGGCTTTCG GAAGGGGATG TGATGCATCT 1260 
GTCCCAGAGT CCCAGCGTCA GTCAGCTAAG TGTCCTGAGT CTAAGTGGGG TCATGCTGAC 1320 
CXJATGTAAGT CCCGAGCCCC TCCAAGCTCT GCTGGAGAGA GCCTCTGOCA OCCTCCAGGA 1380 
CCTGGTC1TT GATGAGTGTG GGATCACGG A TGATCAGCTC CTTGCGCTCC TGCCTTCOCT 1440 
GAGCCACTGC TOCCAGCTTA CAACCTTAAG CTTCTAOGGG AATTCCATCT CCATATCTGC 1500 
CTTGCAGAGT CTOCTGCAGC ACCTCATCGG GCTGAGCAAT CTGACCCACG TGCTGTATCC 1560 
TUrrUOCOL'lO GAOAGTTATG AGOACATCCA TGGTACCCrC CACCTGGAGA GGCTTGOCTA 1620 
TCTGCATOCC AGGCTCAGGG AGTTGCTGTG TGAGTTGGGO CGGCCCAGCA TGGTCTGGCT 1680 
TAGTGCXAAC CCCTGTCCTC ACTGTGGGGA CAGAACCTTC TATG ACCCG G AGOOCATCCT 1740 
flTGCCCCrGT TTCATGCCTA ACTAGCTGGG TOCACATATC AAATGCTTCA TICft lCATA C 1800 
TTGOACACTA AAGCCAGG AT GTGCATGCAT CTTGAAGCAA CAAAGCAGOC ACAGTTTCAG 1860 
ACAAATGTTC AGTGTGAGTG AGGAAAACAT GTTCAGTGAG GAAAAAACAT TCAGACAAAT 1920 
GTTCAGTGAG GAAAAAAAOO GGAAOTTQOO GATAGGCAOA TGTTGACTTG AGGAGTTAAT 1980 
GTGATCTTTG GGOAGATACA TCTTATAGAG TTAGAAATAG AATCTGAATT TCTAAAGGGA 2040 
GATTCTGGCT TGGGAAGTAC ATGTAGGAGT TAATCCCTGT GTAGACTGTT GTAAAGAAAC 2100 
TGTTGAAAAT AAAGAGAAGC AATGTGAAGC AAAAAAAAAA AAAAAAAA 



SeqIDNO: 110 Protein seacence; 
Ptotem Accession f: NP_0O6 106.1 
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